Skip to content
Change the repository type filter

All

    Repositories list

    • DeepGEMM

      Public
      DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
      Cuda
      MIT License
      8266.2k4514Updated Feb 27, 2026Feb 27, 2026
    • 3FS

      Public
      A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
      C++
      MIT License
      1k9.7k11728Updated Feb 25, 2026Feb 25, 2026
    • awesome-deepseek-integration

      Public
      Integrate the DeepSeek API into popular software
      Creative Commons Zero v1.0 Universal
      4k36k19Updated Feb 23, 2026Feb 23, 2026
    • DeepEP

      Public
      DeepEP: an efficient expert-parallel communication library
      Cuda
      MIT License
      1.1k9k16655Updated Feb 9, 2026Feb 9, 2026
    • FlashMLA

      Public
      FlashMLA: Efficient Multi-head Latent Attention Kernels
      C++
      MIT License
      99213k6130Updated Feb 6, 2026Feb 6, 2026
    • DeepSeek-OCR-2

      Public
      Visual Causal Flow
      Python
      Apache License 2.0
      1912.4k444Updated Feb 3, 2026Feb 3, 2026
    • DeepSeek-OCR

      Public
      Contexts Optical Compression
      Python
      MIT License
      2.1k23k24637Updated Jan 27, 2026Jan 27, 2026
    • DualPipe

      Public
      A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.
      Python
      MIT License
      3162.9k32Updated Jan 14, 2026Jan 14, 2026
    • Engram

      Public
      Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models
      Python
      Apache License 2.0
      2583.8k105Updated Jan 14, 2026Jan 14, 2026
    • DeepSeek-Math-V2

      Public
      Python
      Apache License 2.0
      1371.6k81Updated Dec 1, 2025Dec 1, 2025
    • LPLB

      Public
      An early research stage expert-parallel load balancer for MoE models based on linear programming.
      Python
      MIT License
      3349910Updated Nov 19, 2025Nov 19, 2025
    • DeepSeek-V3.2-Exp

      Public
      Python
      MIT License
      1431.5k205Updated Nov 18, 2025Nov 18, 2025
    • awesome-deepseek-coder

      Public
      A curated list of open-source projects related to DeepSeek Coder
      20976900Updated Nov 11, 2025Nov 11, 2025
    • DeepSeek-Coder

      Public
      DeepSeek Coder: Let the Code Write Itself
      Python
      MIT License
      2.7k23k12426Updated Nov 11, 2025Nov 11, 2025
    • DeepSeek-Coder-V2

      Public
      DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
      MIT License
      1k6.5k627Updated Nov 11, 2025Nov 11, 2025
    • DeepSeek-V3

      Public
      Python
      MIT License
      17k102k4455Updated Aug 28, 2025Aug 28, 2025
    • DeepSeek-Prover-V2

      Public
      Other
      941.2k112Updated Jul 18, 2025Jul 18, 2025
    • DeepSeek-R1

      Public
      MIT License
      12k92k2725Updated Jun 27, 2025Jun 27, 2025
    • ESFT

      Public
      Expert Specialized Fine-Tuning
      Python
      MIT License
      26172951Updated May 22, 2025May 22, 2025
    • open-infra-index

      Public
      Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
      Creative Commons Zero v1.0 Universal
      2878k00Updated May 15, 2025May 15, 2025
    • DreamCraft3D

      Public
      [ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
      Python
      MIT License
      3593k341Updated Apr 22, 2025Apr 22, 2025
    • EPLB

      Public
      Expert Parallelism Load Balancer
      Python
      MIT License
      2011.4k64Updated Mar 24, 2025Mar 24, 2025
    • profile-data

      Public
      Analyze computation-communication overlap in V3/R1.
      1451.1k120Updated Mar 21, 2025Mar 21, 2025
    • smallpond

      Public
      A lightweight data processing framework built on DuckDB and 3FS.
      Python
      MIT License
      4434.9k229Updated Mar 5, 2025Mar 5, 2025
    • DeepSeek-VL2

      Public
      DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
      Python
      MIT License
      1.8k5.2k10019Updated Feb 26, 2025Feb 26, 2025
    • Janus

      Public
      Janus-Series: Unified Multimodal Understanding and Generation Models
      Python
      MIT License
      2.2k18k15627Updated Feb 1, 2025Feb 1, 2025
    • DeepSeek-V2

      Public
      DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
      MIT License
      5345k813Updated Sep 25, 2024Sep 25, 2024
    • DeepSeek-Prover-V1.5

      Public
      Python
      MIT License
      23555381Updated Aug 16, 2024Aug 16, 2024
    • DeepSeek-VL

      Public
      DeepSeek-VL: Towards Real-World Vision-Language Understanding
      Python
      MIT License
      5844.1k444Updated Apr 24, 2024Apr 24, 2024
    • DeepSeek-Math

      Public
      DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
      Python
      MIT License
      5703.2k374Updated Apr 15, 2024Apr 15, 2024