Skip to content
Change the repository type filter

All

    Repositories list

    • OpenHands

      Public
      🙌 OpenHands: Code Less, Make More
      Python
      7.9k001Updated Nov 12, 2025Nov 12, 2025
    • Optimizing inference proxy for LLMs
      Python
      241301Updated Nov 12, 2025Nov 12, 2025
    • reap

      Public
      REAP: Router-weighted Expert Activation Pruning for SMoE compression
      Python
      139920Updated Nov 9, 2025Nov 9, 2025
    • Agent computer interface for AI software engineer.
      Python
      57000Updated Oct 7, 2025Oct 7, 2025
    • [ACL 2025] Graph-guided agentic framework for code localization https://arxiv.org/abs/2503.09089
      Python
      77000Updated Oct 3, 2025Oct 3, 2025
    • gepa

      Public
      Optimize prompts, code, and more with AI-powered Reflective Text Evolution
      Jupyter Notebook
      1103Updated Sep 23, 2025Sep 23, 2025
    • Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
      Python
      7.6k000Updated Aug 13, 2025Aug 13, 2025
    • Python
      4000Updated Aug 6, 2025Aug 6, 2025
    • [NeurIPS'24] SpatialEval: a benchmark to evaluate spatial reasoning abilities of MLLMs and LLMs
      Python
      3000Updated Jul 22, 2025Jul 22, 2025
    • HTML
      0000Updated May 6, 2025May 6, 2025
    • Collaboration between Cerebras and DOE Tri-Labs (LLNL, LANL, and SNL)
      TeX
      0000Updated Apr 29, 2025Apr 29, 2025
    • fork of short transformers with better dataset preparation and options
      Python
      0000Updated Dec 15, 2024Dec 15, 2024
    • nanoGNS

      Public
      Minimal reference implementations for per-example gradient norm methods for computing GNS
      Jupyter Notebook
      1800Updated Nov 15, 2024Nov 15, 2024
    • LongBench

      Public
      LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding
      Python
      107001Updated Sep 3, 2024Sep 3, 2024
    • Official repository of Sparse ISO-FLOP Transformations for Maximizing Training Efficiency
      Python
      02510Updated Jul 31, 2024Jul 31, 2024
    • A framework for few-shot evaluation of language models.
      Python
      2.8k000Updated Jul 12, 2024Jul 12, 2024
    • 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
      Python
      31k000Updated Apr 26, 2024Apr 26, 2024
    • Cerebras's internal version of EleutherAI's DeeperSpeed library with bug fixes and patches made for our own version of GPT-Neox at https://github.com/CerebrasResearch/gpt-neox; DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
      Python
      4.6k000Updated Oct 11, 2023Oct 11, 2023
    • RevBiFPN

      Public
      RevBiFPN: The Fully Reversible Bidirectional Feature Pyramid Network
      Python
      11500Updated Oct 18, 2022Oct 18, 2022