Skip to content
Change the repository type filter

All

    Repositories list

    • unlearn

      Public
      Python
      0100Updated Jan 28, 2026Jan 28, 2026
    • bergson

      Public
      Mapping out the "memory" of neural nets with data attribution
      Python
      133976Updated Jan 28, 2026Jan 28, 2026
    • A framework for few-shot evaluation of language models.
      Python
      3k11k545169Updated Jan 27, 2026Jan 27, 2026
    • elk

      Public
      Keeping language models honest by directly eliciting knowledge encoded in their activations.
      Python
      322171510Updated Jan 26, 2026Jan 26, 2026
    • sparsify

      Public
      Sparsify transformers with SAEs and transcoders
      Python
      9468554Updated Jan 26, 2026Jan 26, 2026
    • delphi

      Public
      Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models know themselves through automated interpretability.
      Python
      5523846Updated Jan 26, 2026Jan 26, 2026
    • rh-indicators

      Public
      Python
      0000Updated Jan 22, 2026Jan 22, 2026
    • The simplest, fastest repository for training/finetuning medium-sized GPTs.
      Python
      8.8k18620Updated Jan 19, 2026Jan 19, 2026
    • Problems generated by djinn (exploitably verifiable coding problems)
      0000Updated Jan 18, 2026Jan 18, 2026
    • djinn

      Public
      Generating, validating and running exploitable verifiable coding problems
      Python
      0800Updated Jan 16, 2026Jan 16, 2026
    • emergent-misalignment

      Public
      Python
      86301Updated Jan 13, 2026Jan 13, 2026
    • deep-ignorance

      Public
      Python
      31420Updated Jan 7, 2026Jan 7, 2026
    • MIDI tokenizers and pre-processing utils.
      Python
      3611Updated Dec 26, 2025Dec 26, 2025
    • aria

      Public
      Official repository for the paper: Scaling Self-Supervised Representation Learning for Symbolic Piano Performance (ISMIR 2025)
      Python
      139300Updated Dec 23, 2025Dec 23, 2025
    • gamescope

      Public
      Can interpretability methods confer an advantage in competitive games?
      Python
      0200Updated Dec 19, 2025Dec 19, 2025
    • aria-amt

      Public
      Efficient and robust implementation of seq-to-seq automatic piano transcription.
      Python
      96300Updated Dec 16, 2025Dec 16, 2025
    • gpt-neox

      Public
      An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
      Python
      1.1k7.4k6128Updated Dec 10, 2025Dec 10, 2025
    • pythia

      Public
      The hub for EleutherAI's work on interpretability and learning dynamics
      Jupyter Notebook
      2042.7k163Updated Nov 15, 2025Nov 15, 2025
    • attribute

      Public
      Python
      61501Updated Nov 14, 2025Nov 14, 2025
    • Sparsify transformers with cross-layer transcoders
      Python
      941802Updated Nov 14, 2025Nov 14, 2025
    • Tools for understanding how transformer predictions are built layer-by-layer
      Python
      61200Updated Nov 10, 2025Nov 10, 2025
    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      13k000Updated Nov 3, 2025Nov 3, 2025
    • Simplified library for mapping out the "memory" of neural nets with data attribution
      Python
      13000Updated Oct 26, 2025Oct 26, 2025
    • website

      Public
      New website for EleutherAI based on Hugo static site generator
      HTML
      7712Updated Oct 14, 2025Oct 14, 2025
    • DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
      Python
      4.7k17101Updated Sep 26, 2025Sep 26, 2025
    • Linear probes with attention weighting
      Python
      1800Updated Aug 2, 2025Aug 2, 2025
    • verifiers

      Public
      Verifiers for LLM Reinforcement Learning
      Python
      479000Updated Jul 31, 2025Jul 31, 2025
    • cookbook

      Public
      Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
      Python
      4382983Updated Jul 29, 2025Jul 29, 2025
    • Python
      0100Updated Jul 22, 2025Jul 22, 2025
    • Investigating goal instability in RL
      Python
      0100Updated Jun 2, 2025Jun 2, 2025