Skip to content
Change the repository type filter

All

    Repositories list

    • cuda-quantum

      Public
      C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows
      C++
      31788340882Updated Jan 8, 2026Jan 8, 2026
    • holodeck

      Public
      Holodeck is a project to create test environments optimised for GPU projects.
      Go
      92133Updated Jan 8, 2026Jan 8, 2026
    • cuopt

      Public
      GPU accelerated decision optimization
      Cuda
      1066438530Updated Jan 8, 2026Jan 8, 2026
    • cccl

      Public
      CUDA Core Compute Libraries
      C++
      3172.1k1.1k213Updated Jan 8, 2026Jan 8, 2026
    • KAI-Scheduler

      Public
      KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale
      Go
      1351.1k2663Updated Jan 8, 2026Jan 8, 2026
    • spark-rapids-jni

      Public
      RAPIDS Accelerator JNI For Apache Spark
      Cuda
      7852856Updated Jan 8, 2026Jan 8, 2026
    • Model-Optimizer

      Public
      A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.
      Python
      2321.8k5762Updated Jan 8, 2026Jan 8, 2026
    • Megatron-LM

      Public
      Ongoing research training transformer models at scale
      Python
      3.5k15k317248Updated Jan 8, 2026Jan 8, 2026
    • torch-harmonics

      Public
      Differentiable signal processing on the sphere for PyTorch
      Jupyter Notebook
      6362344Updated Jan 8, 2026Jan 8, 2026
    • cuEquivariance

      Public
      cuEquivariance is a math library that is a collective of low-level primitives and tensor ops to accelerate widely-used models, like DiffDock, MACE, Allegro and NEQUIP, based on equivariant neural networks. Also includes kernels for accelerated structure prediction.
      Python
      24339146Updated Jan 8, 2026Jan 8, 2026
    • TensorRT-Incubator

      Public
      Experimental projects related to TensorRT
      MLIR
      221173712Updated Jan 8, 2026Jan 8, 2026
    • NVSentinel

      Public
      NVSentinel is a cross-platform fault remediation service designed to rapidly remediate runtime node-level issues in GPU-accelerated computing environments
      Go
      311483013Updated Jan 8, 2026Jan 8, 2026
    • TensorRT-LLM

      Public
      TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.
      Python
      2k13k517476Updated Jan 8, 2026Jan 8, 2026
    • doca-platform

      Public
      DOCA Platform manages provisioning and service orchestration for Bluefield DPUs
      Go
      166700Updated Jan 8, 2026Jan 8, 2026
    • phosphor-dbus-interfaces

      Public
      YAML descriptors of standard dbus interfaces
      Meson
      77100Updated Jan 8, 2026Jan 8, 2026
    • recsys-examples

      Public
      Examples for Recommenders - easy to train and deploy on accelerated infrastructure.
      Python
      41200409Updated Jan 8, 2026Jan 8, 2026
    • TileGym

      Public
      Helpful kernel tutorials and examples for tile-based GPU programming
      Python
      3055111Updated Jan 8, 2026Jan 8, 2026
    • OSMO

      Public
      The developer-first platform for scaling complex Physical AI workloads across heterogeneous compute—unifying training GPUs, simulation clusters, and edge devices in a simple YAML
      Python
      6702311Updated Jan 8, 2026Jan 8, 2026
    • NeMo-Agent-Toolkit-UI

      Public
      The NVIDIA NeMo Agent Toolkit UI streamlines interacting with NeMo Agent Toolkit workflows in an easy-to-use web application.
      TypeScript
      446386Updated Jan 8, 2026Jan 8, 2026
    • k8s-device-plugin

      Public
      NVIDIA device plugin for Kubernetes
      Go
      7723.6k7143Updated Jan 8, 2026Jan 8, 2026
    • nvidia-container-toolkit

      Public
      Build and run containers leveraging NVIDIA GPUs
      Go
      4584k11922Updated Jan 8, 2026Jan 8, 2026
    • spark-rapids-tools

      Public
      User tools for Spark RAPIDS
      Scala
      47662653Updated Jan 8, 2026Jan 8, 2026
    • ais-k8s

      Public
      Kubernetes Operator, ansible playbooks, and production scripts for large-scale AIStore deployments on Kubernetes.
      Go
      2511811Updated Jan 8, 2026Jan 8, 2026
    • earth2studio

      Public
      Open-source deep-learning framework for exploring, building and deploying AI weather/climate workflows.
      Python
      893241210Updated Jan 8, 2026Jan 8, 2026
    • Fuser

      Public
      A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
      C++
      74368210217Updated Jan 8, 2026Jan 8, 2026
    • pldm

      Public
      C++
      57502Updated Jan 8, 2026Jan 8, 2026
    • dbus-sensors

      Public
      D-Bus configurable sensor scanning applications
      C++
      59300Updated Jan 8, 2026Jan 8, 2026
    • cuda-q-academic

      Public
      This repo contains CUDA-Q Academic materials, including self-paced Jupyter notebook modules for building and optimizing hybrid quantum-classical algorithms using CUDA-Q.
      Jupyter Notebook
      7123928Updated Jan 8, 2026Jan 8, 2026
    • nsight-python

      Public
      Nsight Python is a Python kernel profiling interface based on NVIDIA Nsight Tools
      Python
      78853Updated Jan 8, 2026Jan 8, 2026
    • phosphor-debug-collector

      Public
      Collects debug data from the BMC for extraction.
      C++
      12100Updated Jan 8, 2026Jan 8, 2026