Pinned Loading
-
KempnerInstitute/AgentsOpenRLHF
KempnerInstitute/AgentsOpenRLHF PublicForked from OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)
Python 1
-
multiturn-rl-agent
multiturn-rl-agent PublicMulti-turn RL agents with simulation-based planning compatible with OpenRLHF
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.
