-
Notifications
You must be signed in to change notification settings - Fork 252
Pull requests: opendilab/awesome-RLHF
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
add paper: LLMs Meet Finance (SFT+DPO+RL for financial NLP)
#86
opened Mar 28, 2026 by
WhymustIhaveaname
Loading…
add(potato): add Potato annotation tool for human evaluation
#84
opened Mar 14, 2026 by
davidjurgens
Loading…
Request for adding paper: Eliminating Inductive Bias in Reward Models with Information-Theoretic Guidance
#82
opened Jan 16, 2026 by
BIRlz
Loading…
ProTip!
Exclude everything labeled
bug with -label:bug.