Change the repository type filter
All
Repositories list
12 repositories
RAT
PublicOfficial code for the NeurIPS25 paper "RAT: Bridging RNN Efficiencyand Attention Accuracy in Language Modeling" (https://arxiv.org/abs/2507.04416))- Official codebase for "Quantile Reward Policy Optimization: Alignment with Pointwise Regression and Exact Partition Functions" (Matrenok et al. 2025).
EvoTune
PublicEfficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.python-ml-research-template
Public templateA template for starting reproducible Python machine-learning projects with hardware acceleration. Find an example at https://github.com/CLAIRE-Labo/no-representation-no-trustopen-instruct
Public- Codebase to fully reproduce the results of "No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO" (Moalla et al. 2024). Uses TorchRL and provides extensive tools for studying representation dynamics in policy optimization.
flash_attention
Publictunable-morl-public
Publicmauve
PublicStructuredFFN
Publicdeep-NExT-GPT
Publicdeep-llava
Public