Skip to content
Change the repository type filter

All

    Repositories list

    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Apache License 2.0
      16k17037Updated Apr 17, 2026Apr 17, 2026
    • Every Eval Ever is a shared schema and crowdsourced eval database. It defines a standardized metadata format for storing AI evaluation results — from leaderboar…
      Python
      MIT License
      29004Updated Apr 16, 2026Apr 16, 2026
    • Fast and memory-efficient exact attention
      C++
      BSD 3-Clause "New" or "Revised" License
      2.6k000Updated Apr 16, 2026Apr 16, 2026
    • A Python library for guardrail models evaluation with vLLM support.
      Python
      European Union Public License 1.2
      100013Updated Apr 15, 2026Apr 15, 2026
    • 🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference an…
      Python
      Apache License 2.0
      33k000Updated Apr 15, 2026Apr 15, 2026
    • Go
      Apache License 2.0
      0004Updated Apr 14, 2026Apr 14, 2026
    • A framework for few-shot evaluation of language models.
      Python
      MIT License
      3.2k501Updated Apr 14, 2026Apr 14, 2026
    • 1213Updated Apr 14, 2026Apr 14, 2026
    • Tool to scrape benchmarks used most commonly in recent popular open source models
      Python
      MIT License
      1000Updated Apr 11, 2026Apr 11, 2026
    • Beam search scheduler plugin for vLLM v1 with CoW block table forking
      Python
      0000Updated Apr 9, 2026Apr 9, 2026
    • SWE-bench

      Public
      SWE-bench: Can Language Models Resolve Real-world Github Issues?
      Python
      MIT License
      826000Updated Apr 8, 2026Apr 8, 2026
    • sglang

      Public
      SGLang is a fast serving framework for large language models and vision language models.
      Python
      Apache License 2.0
      5.4k103Updated Apr 8, 2026Apr 8, 2026
    • axolotl

      Public
      Go ahead and axolotl questions
      Python
      Apache License 2.0
      1.3k005Updated Apr 8, 2026Apr 8, 2026
    • research

      Public
      Repository to enable research flows
      Python
      0303Updated Mar 26, 2026Mar 26, 2026
    • lighteval

      Public
      Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
      Python
      MIT License
      452001Updated Mar 23, 2026Mar 23, 2026
    • lmms-eval

      Public
      Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.
      Python
      Other
      5610012Updated Mar 12, 2026Mar 12, 2026
    • DeepEP

      Public
      DeepEP: an efficient expert-parallel communication library
      Cuda
      MIT License
      1.2k100Updated Mar 11, 2026Mar 11, 2026
    • The 100 line AI agent that solves GitHub issues or helps you in your command line. Radically simple, no huge configs, no giant monorepo—but scores >74% on SWE-b…
      Python
      MIT License
      535000Updated Mar 10, 2026Mar 10, 2026
    • vllm-fork

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Apache License 2.0
      16k001Updated Mar 5, 2026Mar 5, 2026
    • TPU inference for vLLM, with unified JAX and PyTorch support.
      Python
      Apache License 2.0
      163000Updated Mar 5, 2026Mar 5, 2026
    • A framework for efficient model inference with omni-modality models
      Python
      Apache License 2.0
      776201Updated Mar 3, 2026Mar 3, 2026
    • Arena-Hard-Auto: An automatic LLM benchmark.
      Python
      Apache License 2.0
      150003Updated Mar 3, 2026Mar 3, 2026
    • pytorch

      Public
      Tensors and Dynamic neural networks in Python with strong GPU acceleration
      Python
      Other
      28k106Updated Feb 11, 2026Feb 11, 2026
    • Neural Magic GHA
      Python
      Apache License 2.0
      0004Updated Jan 22, 2026Jan 22, 2026
    • nm-vllm

      Public archive
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Other
      16k26600Updated Dec 4, 2025Dec 4, 2025
    • Python
      1001Updated Nov 13, 2025Nov 13, 2025
    • Open Data Hub operator to manage ODH component integrations
      Go
      Apache License 2.0
      252000Updated Nov 12, 2025Nov 12, 2025
    • DeepEP: an efficient expert-parallel communication library
      Cuda
      MIT License
      1.2k000Updated Sep 26, 2025Sep 26, 2025
    • Common mixins, registries, and utilities with native support for Pydantic used across popular repos such as GuideLLM and Speculators
      Apache License 2.0
      0000Updated Sep 17, 2025Sep 17, 2025
    • 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
      Python
      Apache License 2.0
      33k200Updated Sep 12, 2025Sep 12, 2025
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.