Skip to content
Change the repository type filter

All

    Repositories list

    • DuckTrack

      Public
      Multimodal computer agent data collection program
      Python
      2616190Updated Dec 5, 2025Dec 5, 2025
    • The 100 line AI agent that solves GitHub issues or helps you in your command line. Radically simple, no huge configs, no giant monorepo—but scores 68% on SWE-be…
      Python
      364000Updated Aug 21, 2025Aug 21, 2025
    • SWE-agent

      Public
      SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive co…
      Python
      2k000Updated Aug 18, 2025Aug 18, 2025
    • Releases from OpenAI Preparedness
      Python
      118000Updated Aug 15, 2025Aug 15, 2025
    • mle-bench

      Public
      MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering
      Python
      208000Updated Aug 14, 2025Aug 14, 2025
    • deepthink

      Public
      Python
      0000Updated Jul 27, 2025Jul 27, 2025
    • 🚀 SWE-bench Goes Live!
      Python
      23000Updated Jul 25, 2025Jul 25, 2025
    • Open-source implementation of AlphaEvolve
      Python
      842702Updated Jul 10, 2025Jul 10, 2025
    • JavaScript
      14100Updated Feb 19, 2025Feb 19, 2025
    • prm

      Public
      Python
      312243Updated Jan 17, 2025Jan 17, 2025
    • site

      Public
      JavaScript
      0000Updated Nov 22, 2023Nov 22, 2023
    • arb

      Public
      Advanced Reasoning Benchmark Dataset for LLMs
      TypeScript
      34761Updated Nov 19, 2023Nov 19, 2023
    • chonk

      Public
      Python
      15200Updated Oct 18, 2023Oct 18, 2023
    • videorl

      Public
      Python
      15134Updated Oct 6, 2023Oct 6, 2023
    • 0000Updated Aug 25, 2023Aug 25, 2023
    • Community website
      JavaScript
      1010Updated Aug 14, 2023Aug 14, 2023
    • donut

      Public
      Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
      Python
      553200Updated Nov 22, 2022Nov 22, 2022