JC-Chen1

Jiacheng Chen JC-Chen1

Achievements

PRIME-RL/P1 PRIME-RL/P1 Public

P1: Mastering Physics Olympiads with Reinforcement Learning

73 4
PRIME-RL/Entropy-Mechanism-of-RL PRIME-RL/Entropy-Mechanism-of-RL Public

The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.

Python 420 15
MetaEvo/Symbol MetaEvo/Symbol Public

Python implementation of SYMBOL

Python 17 4
MetaEvo/MetaBox MetaEvo/MetaBox Public

MetaBox: Benchmarking Platform for Meta-Black-Box Optimization

Python 157 15
THUDM/slime THUDM/slime Public

slime is an LLM post-training framework for RL Scaling.

Python 3.7k 503
verl-project/verl verl-project/verl Public

verl: Volcano Engine Reinforcement Learning for LLMs

Python 19.1k 3.2k