Skip to content
Change the repository type filter

All

    Repositories list

    • DeepEP

      Public
      DeepEP: an efficient expert-parallel communication library
      Cuda
      MIT License
      1.3k9.7k18374Updated May 26, 2026May 26, 2026
    • awesome-deepseek-agent

      Public
      2822.6k5074Updated May 23, 2026May 23, 2026
    • DeepGEMM

      Public
      DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
      Cuda
      MIT License
      1k7.3k4924Updated May 13, 2026May 13, 2026
    • 3FS

      Public
      A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
      C++
      MIT License
      1.1k9.9k11833Updated May 7, 2026May 7, 2026
    • FlashMLA

      Public
      FlashMLA: Efficient Multi-head Latent Attention Kernels
      C++
      MIT License
      1k13k6636Updated Apr 30, 2026Apr 30, 2026
    • TileKernels

      Public
      A kernel library written in tilelang
      Python
      MIT License
      1311.6k67Updated Apr 23, 2026Apr 23, 2026
    • awesome-deepseek-integration

      Public
      Integrate the DeepSeek API into popular software
      Creative Commons Zero v1.0 Universal
      4.1k38k3048Updated Feb 23, 2026Feb 23, 2026
    • DeepSeek-OCR-2

      Public
      Visual Causal Flow
      Python
      Apache License 2.0
      2512.9k504Updated Feb 3, 2026Feb 3, 2026
    • Contexts Optical Compression
      Python
      MIT License
      2.1k23k24838Updated Jan 27, 2026Jan 27, 2026
    • DualPipe

      Public
      A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.
      Python
      MIT License
      3253k32Updated Jan 14, 2026Jan 14, 2026
    • Engram

      Public
      Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models
      Python
      Apache License 2.0
      3364.4k146Updated Jan 14, 2026Jan 14, 2026
    • Python
      Apache License 2.0
      1471.6k81Updated Dec 1, 2025Dec 1, 2025
    • LPLB

      Public
      An early research stage expert-parallel load balancer for MoE models based on linear programming.
      Python
      MIT License
      3650510Updated Nov 19, 2025Nov 19, 2025
    • Python
      MIT License
      1731.6k227Updated Nov 18, 2025Nov 18, 2025
    • A curated list of open-source projects related to DeepSeek Coder
      21378700Updated Nov 11, 2025Nov 11, 2025
    • DeepSeek Coder: Let the Code Write Itself
      Python
      MIT License
      2.8k24k13428Updated Nov 11, 2025Nov 11, 2025
    • DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
      MIT License
      1.1k6.8k657Updated Nov 11, 2025Nov 11, 2025
    • Python
      MIT License
      17k104k16659Updated Aug 28, 2025Aug 28, 2025
    • Other
      1001.3k112Updated Jul 18, 2025Jul 18, 2025
    • MIT License
      12k92k1924Updated Jun 27, 2025Jun 27, 2025
    • ESFT

      Public
      Expert Specialized Fine-Tuning
      Python
      MIT License
      26373561Updated May 22, 2025May 22, 2025
    • Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
      Creative Commons Zero v1.0 Universal
      2878k01Updated May 15, 2025May 15, 2025
    • [ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
      Python
      MIT License
      3583k341Updated Apr 22, 2025Apr 22, 2025
    • EPLB

      Public
      Expert Parallelism Load Balancer
      Python
      MIT License
      2011.4k64Updated Mar 24, 2025Mar 24, 2025
    • Analyze computation-communication overlap in V3/R1.
      1471.2k130Updated Mar 21, 2025Mar 21, 2025
    • smallpond

      Public
      A lightweight data processing framework built on DuckDB and 3FS.
      Python
      MIT License
      4425k239Updated Mar 5, 2025Mar 5, 2025
    • DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
      Python
      MIT License
      1.8k5.3k10120Updated Feb 26, 2025Feb 26, 2025
    • Janus

      Public
      Janus-Series: Unified Multimodal Understanding and Generation Models
      Python
      MIT License
      2.2k18k15926Updated Feb 1, 2025Feb 1, 2025
    • DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
      MIT License
      5415k833Updated Sep 25, 2024Sep 25, 2024
    • Python
      MIT License
      23957281Updated Aug 16, 2024Aug 16, 2024
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.