Skip to content
Change the repository type filter

All

    Repositories list

    • cccl

      Public
      CUDA Core Compute Libraries
      C++
      2862k1.1k179Updated Nov 6, 2025Nov 6, 2025
    • nvidia-container-toolkit

      Public
      Build and run containers leveraging NVIDIA GPUs
      Go
      4293.8k40631Updated Nov 6, 2025Nov 6, 2025
    • KAI-Scheduler

      Public
      KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale
      Go
      1018922013Updated Nov 6, 2025Nov 6, 2025
    • mig-parted

      Public
      MIG Partition Editor for NVIDIA GPUs
      Go
      502232215Updated Nov 6, 2025Nov 6, 2025
    • TensorRT-Model-Optimizer

      Public
      A unified library of state-of-the-art model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM or TensorRT to optimize inference speed.
      Python
      1901.5k6044Updated Nov 6, 2025Nov 6, 2025
    • tilus

      Public
      Tilus is a tile-level kernel programming language with explicit control over shared memory and registers.
      Python
      839571Updated Nov 6, 2025Nov 6, 2025
    • cuda-quantum

      Public
      C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows
      C++
      29784341981Updated Nov 6, 2025Nov 6, 2025
    • TensorRT-LLM

      Public
      TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.
      C++
      1.8k12k736424Updated Nov 6, 2025Nov 6, 2025
    • NeMo-Agent-Toolkit

      Public
      The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.
      Python
      4121.5k5331Updated Nov 6, 2025Nov 6, 2025
    • linux

      Public
      OpenBMC Linux kernel source tree
      C
      58k700Updated Nov 6, 2025Nov 6, 2025
    • k8s-device-plugin

      Public
      NVIDIA device plugin for Kubernetes
      Go
      7533.5k9139Updated Nov 6, 2025Nov 6, 2025
    • gpu-operator

      Public
      NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes
      Go
      4062.4k38160Updated Nov 6, 2025Nov 6, 2025
    • NVSentinel

      Public
      NVSentinel is a cross-platform fault remediation service designed to rapidly remediate runtime node-level issues in GPU-accelerated computing environments
      Go
      1464245Updated Nov 6, 2025Nov 6, 2025
    • earth2studio

      Public
      Open-source deep-learning framework for exploring, building and deploying AI weather/climate workflows.
      Python
      742851312Updated Nov 6, 2025Nov 6, 2025
    • Megatron-LM

      Public
      Ongoing research training transformer models at scale
      Python
      3.2k14k319178Updated Nov 6, 2025Nov 6, 2025
    • pldm

      Public
      C++
      57402Updated Nov 6, 2025Nov 6, 2025
    • k8s-driver-manager

      Public
      The NVIDIA Driver Manager is a Kubernetes component which assist in seamless upgrades of NVIDIA Driver on each node of the cluster.
      Go
      174144Updated Nov 6, 2025Nov 6, 2025
    • spark-rapids-ml

      Public
      Spark RAPIDS MLlib – accelerate Apache Spark MLlib with GPUs
      Jupyter Notebook
      3184311Updated Nov 6, 2025Nov 6, 2025
    • cudaqx

      Public
      Accelerated libraries for quantum-classical computing built on CUDA-Q.
      C++
      34642313Updated Nov 6, 2025Nov 6, 2025
    • Fuser

      Public
      A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
      C++
      68360206192Updated Nov 6, 2025Nov 6, 2025
    • bionemo-framework

      Public
      BioNeMo Framework: For building and adapting AI models in drug discovery at scale
      Jupyter Notebook
      935625893Updated Nov 6, 2025Nov 6, 2025
    • barney

      Public
      A Scalable (and Optionally, Data-Parallel) ANARI Multi-GPU Path Tracer
      C++
      3700Updated Nov 6, 2025Nov 6, 2025
    • cudnn-frontend

      Public
      cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it
      C++
      135638353Updated Nov 6, 2025Nov 6, 2025
    • OWL

      Public
      The OptiX Wrappers Library
      C++
      3601Updated Nov 6, 2025Nov 6, 2025
    • nv-ingest

      Public
      NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extraction uses specialized NVIDIA NIM microservices to find, contextualize, and extract text, tables, charts and images that you can use in downstream generative applications.
      Python
      2722.8k9837Updated Nov 6, 2025Nov 6, 2025
    • nvidia-resiliency-ext

      Public
      NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the effective training time by minimizing the downtime due to failures and interruptions.
      Python
      34229115Updated Nov 6, 2025Nov 6, 2025
    • spark-rapids-jni

      Public
      RAPIDS Accelerator JNI For Apache Spark
      Cuda
      7451776Updated Nov 6, 2025Nov 6, 2025
    • nvloom

      Public
      nvloom is a set of tools designed to scalably test MNNVL fabrics.
      C++
      52910Updated Nov 6, 2025Nov 6, 2025
    • physicsnemo

      Public
      Open-source deep-learning framework for building, training, and fine-tuning deep learning models using state-of-the-art Physics-ML methods
      Python
      4732k3832Updated Nov 6, 2025Nov 6, 2025
    • k8s-nim-operator

      Public
      An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.
      Go
      33131731Updated Nov 6, 2025Nov 6, 2025