Skip to content
Change the repository type filter

All

    Repositories list

    • slime

      Public
      slime is an LLM post-training framework for RL Scaling.
      Python
      4283.4k11541Updated Jan 19, 2026Jan 19, 2026
    • AgentRL

      Public
      Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework
      Python
      1018970Updated Jan 17, 2026Jan 17, 2026
    • CaRR

      Public
      This repository contains the code and data for the paper "Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards".
      Python
      34510Updated Jan 12, 2026Jan 12, 2026
    • MobileRL

      Public
      Python
      65210Updated Dec 23, 2025Dec 23, 2025
    • AgentBench

      Public
      A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
      Python
      2223.1k578Updated Nov 17, 2025Nov 17, 2025
    • ComputerRL

      Public
      Python
      51230Updated Nov 7, 2025Nov 7, 2025
    • PETra

      Public
      Python
      0200Updated Nov 5, 2025Nov 5, 2025
    • AlignBench

      Public
      大模型多维度中文对齐评测基准 (ACL 2024)
      Python
      30421150Updated Oct 25, 2025Oct 25, 2025
    • Python
      11020Updated Oct 15, 2025Oct 15, 2025
    • DeepDive

      Public
      DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL
      Python
      1923220Updated Oct 2, 2025Oct 2, 2025
    • TDRM

      Public
      Python
      1900Updated Sep 25, 2025Sep 25, 2025
    • ReST-RL

      Public
      Reinforcing LLM Reasoning through Self-Training and Value-Guided Decoding
      Python
      01300Updated Sep 18, 2025Sep 18, 2025
    • INFTY

      Public
      INFTY Engine: An Optimization Toolkit to Support Continual AI
      Python
      956600Updated Sep 13, 2025Sep 13, 2025
    • DataSciBench

      Public
      DataSciBench: An LLM Agent Benchmark for Data Science
      Python
      54900Updated Sep 1, 2025Sep 1, 2025
    • Python
      2128720Updated Aug 18, 2025Aug 18, 2025
    • SWE-Dev

      Public
      [ACL25' Findings] SWE-Dev is an SWE agent with a scalable test case construction pipeline.
      Python
      05710Updated Jul 21, 2025Jul 21, 2025
    • z-ai-sdk-typescript

      Public
      Typescript SDK for Z.ai - Not yet released.
      TypeScript
      1610Updated Jul 17, 2025Jul 17, 2025
    • BiPro

      Public
      code and data for Paper: BIPro: Zero-shot Chinese Poem Generation via Block Inverse Prompting Constrained Generation Framework(ACL 2025 main)
      Python
      0600Updated Jun 28, 2025Jun 28, 2025
    • [ICLR 2025] LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
      Python
      1841.8k282Updated Jun 24, 2025Jun 24, 2025
    • TreeRL

      Public
      TreeRL: LLM Reinforcement Learning with On-Policy Tree Search in ACL'25
      Python
      88640Updated Jun 16, 2025Jun 16, 2025
    • WebRL

      Public
      Building Open LLM Web Agents with Self-Evolving Online Curriculum RL
      Python
      3149500Updated Jun 6, 2025Jun 6, 2025
    • AndroidGen

      Public
      Python
      11110Updated May 29, 2025May 29, 2025
    • AlignMMBench

      Public
      code, data and model for Paper: AlignMMBench: Evaluating Chinese Multimodal Alignment in Large Vision-Language Models (ACL'25 main)
      Python
      2510Updated May 20, 2025May 20, 2025
    • CogKit

      Public
      Finetuning and inference tools for the CogView4 and CogVideoX model series.
      Python
      13112181Updated May 14, 2025May 14, 2025
    • Towards Large Multimodal Models as Visual Foundation Agents
      Python
      9249160Updated Apr 24, 2025Apr 24, 2025
    • Source code of paper: A Stronger Mixture of Low-Rank Experts for Fine-Tuning Foundation Models. (ICML 2025)
      Python
      13500Updated Apr 2, 2025Apr 2, 2025
    • Parameter-Efficient Fine-Tuning for Foundation Models
      310610Updated Mar 31, 2025Mar 31, 2025
    • WebGLM

      Public
      WebGLM: An Efficient Web-enhanced Question Answering System (KDD 2023)
      Python
      1361.6k511Updated Mar 25, 2025Mar 25, 2025
    • WhoIsWho

      Public
      KDD'23 Web-Scale Academic Name Disambiguation: the WhoIsWho Benchmark, Leaderboard, and Toolkit
      Python
      174760Updated Mar 19, 2025Mar 19, 2025
    • Jupyter Notebook
      11850Updated Feb 24, 2025Feb 24, 2025