gokulmk-12 / LLM-Reasoning Public

Notifications You must be signed in to change notification settings
Fork 0
Star 0

Finetuning LLM to reason using RL from Human Feedback

Apache-2.0 license

0 stars 0 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
__pycache__		__pycache__
grpo		grpo
LICENSE		LICENSE
README.md		README.md
models.py		models.py

Repository files navigation

LLM-Reasoning-

Finetuning LLM to reason using RL from Human Feedback

About

Finetuning LLM to reason using RL from Human Feedback

Apache-2.0 license

Report repository

Releases

No releases published

Packages

No packages published

Languages

Python 100.0%