Skip to content
Change the repository type filter

All

    Repositories list

    • ldp

      Public
      Framework enabling modular interchange of language agents, environments, and optimizers
      Python
      18121512Updated Jan 28, 2026Jan 28, 2026
    • paper-qa

      Public
      High accuracy RAG for answering questions from scientific documents with citations
      Python
      8178k1292Updated Jan 28, 2026Jan 28, 2026
    • aviary

      Public
      A language agent gym with challenging scientific tasks
      Python
      3123592Updated Jan 27, 2026Jan 27, 2026
    • feathers

      Public archive
      Design system for Future House apps
      TypeScript
      0001Updated Dec 6, 2025Dec 6, 2025
    • Documentation and tutorials for the FutureHouse platform API
      3720Updated Dec 3, 2025Dec 3, 2025
    • robin

      Public
      Robin: A multi-agent system for automating scientific discovery
      Python
      3827631Updated Nov 24, 2025Nov 24, 2025
    • ether0

      Public
      A scientific reasoning model, dataset, and reward functions for chemistry.
      Python
      1815041Updated Oct 26, 2025Oct 26, 2025
    • BixBench

      Public
      Benchmark for LLM-based Agents in Computational Biology
      Python
      136710Updated Oct 6, 2025Oct 6, 2025
    • data-analysis-crow

      Public
      An aviary-based data science agent based on jupyter notebooks
      HTML
      124300Updated Sep 30, 2025Sep 30, 2025
    • LAB-Bench

      Public
      Evaluation dataset for AI systems intended to benchmark capabilities foundational to scientific research in biology
      Python
      129750Updated Sep 27, 2025Sep 27, 2025
    • Jupyter Notebook
      0100Updated Sep 19, 2025Sep 19, 2025
    • trl

      Public
      FutureHouse fork of trl
      Python
      2.5k105Updated Mar 12, 2025Mar 12, 2025
    • llm-client

      Public archive
      Central LLM client for use by Aviary and PaperQA
      Python
      0200Updated Feb 23, 2025Feb 23, 2025
    • LitQA

      Public archive
      LitQA Eval: A difficult set of scientific questions that require context of full-text research papers to answer
      Python
      54320Updated Dec 18, 2024Dec 18, 2024
    • WikiCrow

      Public
      23800Updated Oct 24, 2024Oct 24, 2024
    • SWE-bench

      Public
      Fork of upstream
      Python
      745000Updated Jul 24, 2024Jul 24, 2024