-
The Chinese University of Hong Kong
- https://jc-chen1.github.io/
- @Jiacheng_c
- in/jiacheng-chen-6746742b6
Highlights
- Pro
Pinned Loading
-
-
PRIME-RL/Entropy-Mechanism-of-RL
PRIME-RL/Entropy-Mechanism-of-RL PublicThe Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.
-
-
MetaEvo/MetaBox
MetaEvo/MetaBox PublicMetaBox: Benchmarking Platform for Meta-Black-Box Optimization
-
THUDM/slime
THUDM/slime Publicslime is an LLM post-training framework for RL Scaling.
-
volcengine/verl
volcengine/verl Publicverl: Volcano Engine Reinforcement Learning for LLMs
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.



