Skip to content
Change the repository type filter

All

    Repositories list

    • MARTI

      Public
      A Framework for LLM-based Multi-Agent Reinforced Training and Inference
      Python
      MIT License
      4643830Updated Feb 19, 2026Feb 19, 2026
    • Awesome-Memory-for-Agents

      Public
      A Collection of Papers about Memory for Language Agents
      MIT License
      1835311Updated Jan 21, 2026Jan 21, 2026
    • Awesome-RL-for-LRMs

      Public
      A Survey of Reinforcement Learning for Large Reasoning Models
      TeX
      MIT License
      1282.3k30Updated Nov 9, 2025Nov 9, 2025
    • AdsQA

      Public
      [ICCV 2025] AdsQA: Towards Advertisement Video Understanding Arxiv: https://arxiv.org/abs/2509.08621
      Python
      23340Updated Oct 30, 2025Oct 30, 2025
    • Decomposed-Forward-Pass

      Public
      [NeurIPS 2025] DePass: Unified Feature Attributing by Simple Decomposed Forward Pass
      Python
      1810Updated Oct 14, 2025Oct 14, 2025
    • Unify-Post-Training

      Public
      Towards a Unified View of Large Language Model Post-Training
      Python
      MIT License
      920470Updated Sep 8, 2025Sep 8, 2025
    • SSRL

      Public
      SSRL: Self-Search Reinforcement Learning
      Python
      Apache License 2.0
      1420700Updated Aug 20, 2025Aug 20, 2025
    • MedXpertQA

      Public
      [ICML 2025] MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding
      Python
      MIT License
      914131Updated Jul 17, 2025Jul 17, 2025
    • 0300Updated Jul 2, 2025Jul 2, 2025
    • Fourier-Position-Embedding

      Public
      [ICML 2025] Fourier Position Embedding: Enhancing Attention’s Periodic Extension for Length Generalization
      Python
      711030Updated Jun 2, 2025Jun 2, 2025
    • TPAMI 2025 Survey Paper
      Python
      12500Updated Mar 31, 2025Mar 31, 2025
    • FS-GEN

      Public
      Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding.
      Python
      41310Updated Nov 19, 2024Nov 19, 2024
    • LPA

      Public
      [EMNLP 2024, Main Conference] Scalable Efficient Training of Large Language Models with Low-dimensional Projected Attention
      Python
      01000Updated Nov 5, 2024Nov 5, 2024
    • [NeurIPS 2024 D&B Track, Spotlight] UltraMedical: Building Specialized Generalists in Biomedicine
      Python
      49410Updated Sep 26, 2024Sep 26, 2024
    • CRaSh

      Public
      [EMNLP 2023, Main conference] CRaSh: Clustering, Removing, and Sharing Enhance Fine-tuning without Full Large Language Model.
      Python
      0400Updated Aug 26, 2024Aug 26, 2024
    • CoGenesis

      Public
      [ACL 2024, Main Conference] CoGenesis: A Framework Collaborating Large and Small Language Models for Secure Context-Aware Instruction Following.
      Python
      11220Updated Aug 7, 2024Aug 7, 2024
    • [ACL 2025, Main Conference, Oral] Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process
      Python
      03000Updated Aug 2, 2024Aug 2, 2024
    • [COLM 2024] Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation
      Python
      01500Updated Jul 15, 2024Jul 15, 2024
    • .github

      Public
      0000Updated Apr 27, 2024Apr 27, 2024
    • SoRA

      Public
      [EMNLP 2023, Main Conference] Sparse Low-rank Adaptation of Pre-trained Language Models
      Python
      1184110Updated Mar 5, 2024Mar 5, 2024