Skip to content
Change the repository type filter

All

    Repositories list

    • HPC-AI-SDK

      Public
      HPC-AI TECH 's Fine-tuning SDK
      Python
      3000Updated Jan 6, 2026Jan 6, 2026
    • Making large AI models cheaper, faster and more accessible
      Python
      4.5k41k43646Updated Dec 22, 2025Dec 22, 2025
    • public_assets

      Public
      Storing publicly available assets such as images, animations and texts
      Python
      191400Updated Nov 28, 2025Nov 28, 2025
    • TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in performant way.
      C++
      2k100Updated Oct 13, 2025Oct 13, 2025
    • A unified library of state-of-the-art model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM or TensorRT to optimize inference speed.
      Python
      231101Updated Oct 12, 2025Oct 12, 2025
    • ColossalAI-Documentation

      Public
      Documentation for Colossal-AI
      JavaScript
      122341Updated Jun 6, 2025Jun 6, 2025
    • Open-Sora

      Public
      Open-Sora: Democratizing Efficient Video Production for All
      Python
      2.8k28k17Updated Apr 30, 2025Apr 30, 2025
    • 0800Updated Mar 19, 2025Mar 19, 2025
    • Oh-My-Dockerfile

      Public
      A collection of dockerfiles for various tasks
      Dockerfile
      92200Updated Feb 20, 2025Feb 20, 2025
    • CLI for ColossalAI Platform
      Python
      21010Updated Feb 20, 2025Feb 20, 2025
    • TensorNVMe

      Public
      A Python library transfers PyTorch tensors between CPU and NVMe
      C++
      2712391Updated Nov 27, 2024Nov 27, 2024
    • 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
      Python
      32k1100Updated Nov 19, 2024Nov 19, 2024
    • LLaVA-NeXT

      Public
      Python
      434100Updated Nov 18, 2024Nov 18, 2024
    • graphrag

      Public
      A modular graph-based Retrieval-Augmented Generation (RAG) system
      Python
      3.2k100Updated Aug 3, 2024Aug 3, 2024
    • FastFold

      Public
      Optimizing AlphaFold Training and Inference on GPU Clusters
      Python
      89611386Updated Jul 16, 2024Jul 16, 2024
    • Cloud-Platform-Docs

      Public
      Documentation for our cloud platform
      JavaScript
      3101Updated Apr 3, 2024Apr 3, 2024
    • This repository contains Huawei Ascend CANN files
      0110Updated Feb 27, 2024Feb 27, 2024
    • Efficient AI Inference & Serving
      Python
      3147930Updated Jan 8, 2024Jan 8, 2024
    • EnergonAI

      Public archive
      Large-scale model inference.
      Python
      85627402Updated Sep 12, 2023Sep 12, 2023
    • pytest-testmon

      Public
      Selects tests affected by changed files. Executes the right tests first. Continuous test runner when used with pytest-watch.
      Python
      71000Updated Jul 25, 2023Jul 25, 2023
    • Elixir

      Public
      Elixir: Train a Large Language Model on a Small GPU Cluster
      Python
      51501Updated Jun 8, 2023Jun 8, 2023
    • ColossalAI-Examples

      Public archive
      Examples of training models with hybrid parallelism using ColossalAI
      Python
      102339292Updated Mar 23, 2023Mar 23, 2023
    • mmdetection-examples

      Public archive
      Train mmdetection models with ColossalAI.
      Python
      0210Updated Feb 18, 2023Feb 18, 2023
    • PaLM-colossalai

      Public archive
      Scalable PaLM implementation of PyTorch
      Python
      2718972Updated Dec 19, 2022Dec 19, 2022
    • GPT-Demo

      Public archive
      GPT Demo with hybrid distributed training
      Python
      71020Updated Dec 1, 2022Dec 1, 2022
    • Titans

      Public archive
      A collection of models built with ColossalAI
      Python
      163261Updated Nov 22, 2022Nov 22, 2022
    • ColossalAI-Pytorch-lightning

      Public
      Python
      62400Updated Nov 22, 2022Nov 22, 2022
    • OPT-Benchmark

      Public archive
      Python
      5410Updated Nov 22, 2022Nov 22, 2022
    • A memory efficient DLRM training solution using ColossalAI
      Python
      1410620Updated Nov 22, 2022Nov 22, 2022
    • SkyComputing

      Public archive
      Sky Computing: Accelerating Geo-distributed Computing in Federated Learning
      Python
      219010Updated Nov 22, 2022Nov 22, 2022