HPC-AI Tech

All

32 repositories

HPC-AI-SDK
Public
HPC-AI TECH 's Fine-tuning SDK
Python
•
Apache License 2.0
•3•0•0•0•Updated Jan 6, 2026Jan 6, 2026
ColossalAI
Public
Making large AI models cheaper, faster and more accessible
ai deep-learning hpc distributed-computing inference big-model large-scale data-parallelism model-parallelism pipeline-parallelism
Python
•
Apache License 2.0
•4.5k•41k•436•46•Updated Dec 22, 2025Dec 22, 2025
public_assets
Public
Storing publicly available assets such as images, animations and texts
Python
•
Apache License 2.0
•19•14•0•0•Updated Nov 28, 2025Nov 28, 2025
TensorRT-LLM
Public
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in performant way.
C++
•
Apache License 2.0
•2k•1•0•0•Updated Oct 13, 2025Oct 13, 2025
TensorRT-Model-Optimizer
Public
A unified library of state-of-the-art model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM or TensorRT to optimize inference speed.
Python
•
Apache License 2.0
•231•1•0•1•Updated Oct 12, 2025Oct 12, 2025
ColossalAI-Documentation
Public
Documentation for Colossal-AI
JavaScript
•
Apache License 2.0
•12•23•4•1•Updated Jun 6, 2025Jun 6, 2025
Open-Sora
Public
Open-Sora: Democratizing Efficient Video Production for All
Python
•
Apache License 2.0
•2.8k•28k•1•7•Updated Apr 30, 2025Apr 30, 2025
Open-Sora-Demo
Public
0•8•0•0•Updated Mar 19, 2025Mar 19, 2025
Oh-My-Dockerfile
Public
A collection of dockerfiles for various tasks
Dockerfile
•
Apache License 2.0
•9•22•0•0•Updated Feb 20, 2025Feb 20, 2025
ColossalAI-Platform-CLI
Public
CLI for ColossalAI Platform
Python
•
Apache License 2.0
•2•10•1•0•Updated Feb 20, 2025Feb 20, 2025
TensorNVMe
Public
A Python library transfers PyTorch tensors between CPU and NVMe
deep-learning pytorch nvme colossal-ai
C++
•27•123•9•1•Updated Nov 27, 2024Nov 27, 2024
transformers
Public
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python
•
Apache License 2.0
•32k•11•0•0•Updated Nov 19, 2024Nov 19, 2024
LLaVA-NeXT
Public
Python
•
Apache License 2.0
•434•1•0•0•Updated Nov 18, 2024Nov 18, 2024
graphrag
Public
A modular graph-based Retrieval-Augmented Generation (RAG) system
Python
•
MIT License
•3.2k•1•0•0•Updated Aug 3, 2024Aug 3, 2024
FastFold
Public
Optimizing AlphaFold Training and Inference on GPU Clusters
gpu protein-structure cuda pytorch parallelism protein-folding alphafold2 evoformer habana-gaudi
Python
•
Apache License 2.0
•89•611•38•6•Updated Jul 16, 2024Jul 16, 2024
Cloud-Platform-Docs
Public
Documentation for our cloud platform
JavaScript
•3•1•0•1•Updated Apr 3, 2024Apr 3, 2024
CANN-Installer
Public
This repository contains Huawei Ascend CANN files
Apache License 2.0
•0•1•1•0•Updated Feb 27, 2024Feb 27, 2024
SwiftInfer
Public
Efficient AI Inference & Serving
deep-learning inference artificial-intelligence llama gpt llm-serving llm-inference llama2
Python
•
Apache License 2.0
•31•479•3•0•Updated Jan 8, 2024Jan 8, 2024
EnergonAI
Public archive
Large-scale model inference.
Python
•
Apache License 2.0
•85•627•40•2•Updated Sep 12, 2023Sep 12, 2023
pytest-testmon
Public
Selects tests affected by changed files. Executes the right tests first. Continuous test runner when used with pytest-watch.
Python
•
GNU Affero General Public License v3.0
•71•0•0•0•Updated Jul 25, 2023Jul 25, 2023
Elixir
Public
Elixir: Train a Large Language Model on a Small GPU Cluster
efficient memory-management large-language-models
Python
•5•15•0•1•Updated Jun 8, 2023Jun 8, 2023
ColossalAI-Examples
Public archive
Examples of training models with hybrid parallelism using ColossalAI
Python
•
Apache License 2.0
•102•339•29•2•Updated Mar 23, 2023Mar 23, 2023
mmdetection-examples
Public archive
Train mmdetection models with ColossalAI.
Python
•0•2•1•0•Updated Feb 18, 2023Feb 18, 2023
PaLM-colossalai
Public archive
Scalable PaLM implementation of PyTorch
Python
•
Apache License 2.0
•27•189•7•2•Updated Dec 19, 2022Dec 19, 2022
GPT-Demo
Public archive
GPT Demo with hybrid distributed training
Python
•
Apache License 2.0
•7•10•2•0•Updated Dec 1, 2022Dec 1, 2022
Titans
Public archive
A collection of models built with ColossalAI
Python
•
Apache License 2.0
•16•32•6•1•Updated Nov 22, 2022Nov 22, 2022
ColossalAI-Pytorch-lightning
Public
Python
•6•24•0•0•Updated Nov 22, 2022Nov 22, 2022
OPT-Benchmark
Public archive
Python
•
Apache License 2.0
•5•4•1•0•Updated Nov 22, 2022Nov 22, 2022
CachedEmbedding
Public
A memory efficient DLRM training solution using ColossalAI
nlp deep-learning pytorch embeddings dlrm recommandation-system colossal-ai
Python
•
Apache License 2.0
•14•106•2•0•Updated Nov 22, 2022Nov 22, 2022
SkyComputing
Public archive
Sky Computing: Accelerating Geo-distributed Computing in Federated Learning
Python
•
Apache License 2.0
•21•90•1•0•Updated Nov 22, 2022Nov 22, 2022