Skip to content

Pinned Loading

  1. OLMo OLMo Public

    Modeling, training, eval, and inference code for OLMo

    Python 6.2k 683

  2. dolma dolma Public

    Data and tools for generating and inspecting OLMo pre-training data.

    Python 1.4k 162

  3. ai2thor ai2thor Public

    An open-source platform for Visual AI.

    C# 1.6k 265

  4. olmocr olmocr Public

    Toolkit for linearizing PDFs for LLM datasets/training

    Python 16.2k 1.2k

  5. OLMoE OLMoE Public

    OLMoE: Open Mixture-of-Experts Language Models

    Jupyter Notebook 927 87

Repositories

Showing 10 of 535 repositories
  • OLMo-core Public

    PyTorch building blocks for the OLMo ecosystem

    allenai/OLMo-core’s past year of commit activity
    Python 546 Apache-2.0 98 9 42 Updated Dec 14, 2025
  • olmo-cookbook Public

    OLMost every training recipe you need to perform data interventions with the OLMo family of models.

    allenai/olmo-cookbook’s past year of commit activity
    Python 58 Apache-2.0 11 1 31 Updated Dec 14, 2025
  • autodiscovery Public

    Official code for NeurIPS 2025 paper "AutoDiscovery: Open-ended Scientific Discovery via Bayesian Surprise"

    allenai/autodiscovery’s past year of commit activity
    Python 111 16 0 0 Updated Dec 13, 2025
  • olmoearth_pretrain Public

    Earth system foundation model data, training, and eval

    allenai/olmoearth_pretrain’s past year of commit activity
    Python 113 18 2 13 Updated Dec 13, 2025
  • open-instruct Public

    AllenAI's post-training codebase

    allenai/open-instruct’s past year of commit activity
    Python 3,430 Apache-2.0 474 12 (1 issue needs help) 45 Updated Dec 13, 2025
  • datamap-rs Public

    Data mapping framework for rust stuff

    allenai/datamap-rs’s past year of commit activity
    Rust 36 Apache-2.0 4 0 3 Updated Dec 13, 2025
  • allenai/rslearn_projects’s past year of commit activity
    Python 16 Apache-2.0 6 15 8 Updated Dec 12, 2025
  • rslearn Public

    A tool for developing remote sensing datasets and models.

    allenai/rslearn’s past year of commit activity
    Python 59 Apache-2.0 10 20 7 Updated Dec 12, 2025
  • beaker-gantry Public

    Gantry is a CLI that streamlines running experiments in Beaker

    allenai/beaker-gantry’s past year of commit activity
    Python 28 Apache-2.0 7 2 2 Updated Dec 13, 2025
  • olmocr Public

    Toolkit for linearizing PDFs for LLM datasets/training

    allenai/olmocr’s past year of commit activity
    Python 16,194 Apache-2.0 1,248 32 14 Updated Dec 13, 2025