Skip to content
Change the repository type filter

All

    Repositories list

    • cuda-python

      Public
      CUDA Python: Performance meets Productivity
      Cython
      2423.1k20617Updated Jan 27, 2026Jan 27, 2026
    • Collection of step-by-step playbooks for setting up AI/ML workloads on NVIDIA DGX Spark devices with Blackwell architecture.
      Jupyter Notebook
      1123901513Updated Jan 27, 2026Jan 27, 2026
    • NeMo-text-processing

      Public
      NeMo text processing for ASR and TTS
      Python
      14241713Updated Jan 27, 2026Jan 27, 2026
    • TensorRT-LLM

      Public
      TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.
      Python
      2k13k505465Updated Jan 27, 2026Jan 27, 2026
    • accelerated-computing-hub

      Public
      NVIDIA curated collection of educational resources related to general purpose GPU programming.
      Jupyter Notebook
      1991.1k145Updated Jan 27, 2026Jan 27, 2026
    • Megatron-LM

      Public
      Ongoing research training transformer models at scale
      Python
      3.5k15k308280Updated Jan 27, 2026Jan 27, 2026
    • cccl

      Public
      CUDA Core Compute Libraries
      C++
      3262.1k1.2k208Updated Jan 27, 2026Jan 27, 2026
    • cloud-native-docs

      Public
      Documentation repository for NVIDIA Cloud Native Technologies
      PowerShell
      3535413Updated Jan 27, 2026Jan 27, 2026
    • Fuser

      Public
      A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
      C++
      75375212205Updated Jan 27, 2026Jan 27, 2026
    • cuopt

      Public
      GPU accelerated decision optimization
      Cuda
      1186838417Updated Jan 27, 2026Jan 27, 2026
    • cuEquivariance

      Public
      cuEquivariance is a math library that is a collective of low-level primitives and tensor ops to accelerate widely-used models, like DiffDock, MACE, Allegro and NEQUIP, based on equivariant neural networks. Also includes kernels for accelerated structure prediction.
      Python
      24349143Updated Jan 27, 2026Jan 27, 2026
    • spark-rapids

      Public
      Spark RAPIDS plugin - accelerate Apache Spark with GPUs
      Scala
      2719591.8k34Updated Jan 27, 2026Jan 27, 2026
    • libredfish

      Public
      A Rust Crate for interacting with DTMF Redfish endpoints
      Rust
      111201Updated Jan 27, 2026Jan 27, 2026
    • cuda-quantum

      Public
      C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows
      C++
      32489842487Updated Jan 27, 2026Jan 27, 2026
    • kvpress

      Public
      LLM KV cache compression made easy
      Python
      9886331Updated Jan 27, 2026Jan 27, 2026
    • Model-Optimizer

      Public
      A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.
      Python
      2421.9k6368Updated Jan 27, 2026Jan 27, 2026
    • NVSentinel

      Public
      NVSentinel is a cross-platform fault remediation service designed to rapidly remediate runtime node-level issues in GPU-accelerated computing environments
      Go
      371653515Updated Jan 27, 2026Jan 27, 2026
    • NeMo-Agent-Toolkit

      Public
      The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.
      Python
      4961.8k6728Updated Jan 27, 2026Jan 27, 2026
    • cloudai

      Public
      CloudAI Benchmark Framework
      Python
      428236Updated Jan 27, 2026Jan 27, 2026
    • makani

      Public
      Massively parallel training of machine-learning based weather and climate models
      Python
      6335144Updated Jan 27, 2026Jan 27, 2026
    • mig-parted

      Public
      MIG Partition Editor for NVIDIA GPUs
      Go
      562412226Updated Jan 27, 2026Jan 27, 2026
    • aistore

      Public
      AIStore: scalable storage for AI applications
      Go
      2321.7k20Updated Jan 27, 2026Jan 27, 2026
    • nv-ingest

      Public
      NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extraction uses specialized NVIDIA NIM microservices to find, contextualize, and extract text, tables, charts and images that you can use in downstream generative applications.
      Python
      2882.8k10141Updated Jan 27, 2026Jan 27, 2026
    • k8s-dra-driver-gpu

      Public
      NVIDIA DRA Driver for GPUs
      Go
      1135528724Updated Jan 27, 2026Jan 27, 2026
    • gpu-operator

      Public
      NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes
      Go
      4432.5k9366Updated Jan 27, 2026Jan 27, 2026
    • numba-cuda

      Public
      The CUDA target for Numba
      Python
      5524610236Updated Jan 27, 2026Jan 27, 2026
    • JAX-Toolbox

      Public
      JAX-Toolbox
      Python
      683818042Updated Jan 27, 2026Jan 27, 2026
    • nvidia-code-mgmt

      Public
      Non-PLDM firmware update infrastructure
      C++
      2500Updated Jan 27, 2026Jan 27, 2026
    • NV-Kernels

      Public
      Ubuntu kernels which are optimized for NVIDIA server systems
      5489019Updated Jan 27, 2026Jan 27, 2026
    • nvidia-container-toolkit

      Public
      Build and run containers leveraging NVIDIA GPUs
      Go
      4664k10522Updated Jan 27, 2026Jan 27, 2026