Skip to content
Change the repository type filter

All

    Repositories list

    • Python
      03510Updated Jan 16, 2026Jan 16, 2026
    • LoRe

      Public
      When Reasoning Meets Its Laws
      Python
      33410Updated Jan 2, 2026Jan 2, 2026
    • [ICLR 2026] Official implementation for "On the Fragility of Benchmark Contamination Detection in Reasoning Models"
      Jupyter Notebook
      01000Updated Oct 9, 2025Oct 9, 2025
    • Python
      0100Updated Oct 7, 2025Oct 7, 2025
    • DecepChain

      Public
      Official implementation for "DecepChain: Inducing Deceptive Reasoning in Large Language Models"
      Python
      0400Updated Oct 5, 2025Oct 5, 2025
    • ASTRA

      Public
      [CVPR 2025] Official implementation for "Steering Away from Harm: An Adaptive Approach to Defending Vision Language Model Against Jailbreaks"
      Python
      25010Updated Jul 5, 2025Jul 5, 2025
    • AlphaOne

      Public
      [EMNLP 2025 Main] AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
      Python
      58900Updated Jun 10, 2025Jun 10, 2025
    • SVIP

      Public
      SVIP: Towards Verifiable Inference of Open-Source Large Language Models
      Python
      11300Updated Jun 3, 2025Jun 3, 2025
    • [ICML 2025] Official implementation for "The Emperor's New Clothes in Benchmarking? A Rigorous Examination of Mitigation Strategies for LLM Benchmark Data Conta…
      Python
      01400Updated May 23, 2025May 23, 2025