Pinned Loading
-
self_ref_feedback
self_ref_feedback PublicCode for Improving Large Language Model Alignment from Self-Reference Model Feedback
Python 7
-
slime
slime PublicForked from THUDM/slime
slime is a LLM post-training framework aiming at scaling RL.
Python
-
sgl-project/sglang
sgl-project/sglang PublicSGLang is a high-performance serving framework for large language models and multimodal models.
-
verl-project/verl
verl-project/verl Publicverl: Volcano Engine Reinforcement Learning for LLMs
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.

