Skip to content

Devesh-Maheshwari/verifiable-rl-coder

Repository files navigation

verifiable-rl-coder

RL post-training for small open coding models using verifiable rewards from actual test execution. Proposer / verifier / reviewer pipeline, sandboxed code runner, GRPO training loop.

Status: planning. See docs/PLAN.md.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors