Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Import TrainerCallback from top-level transformers
#4694 opened Dec 15, 2025 by qgallouedec Loading…
5 tasks
fix: invalidate ZeRO-3 param coordinator trace in add_hooks
#4693 opened Dec 15, 2025 by roycho96 Loading…
1 of 5 tasks
Fix KeyError with transformers 5.0.0+ where push_to_hub_token is removed
#4691 opened Dec 14, 2025 by Manodeepray Loading…
3 tasks done
Fix typos
#4690 opened Dec 14, 2025 by qgallouedec Loading…
feat: DeepSeek V3.2 Off-policy sequence masking
#4689 opened Dec 13, 2025 by casinca Draft
5 tasks
GKDTrainer: Fix return_outputs in Liger kernel path and update tests
#4688 opened Dec 13, 2025 by roycho96 Loading…
2 of 5 tasks
Align stable trainers
#4687 opened Dec 12, 2025 by qgallouedec Loading…
5 tasks
Align GRPO and RLOO initialization
#4685 opened Dec 12, 2025 by qgallouedec Loading…
Align import utils with transformers
#4684 opened Dec 12, 2025 by qgallouedec Loading…
Move get_reward function to experimental.utils
#4683 opened Dec 12, 2025 by qgallouedec Loading…
5 tasks
loss calculation for evaluation without training
#4673 opened Dec 11, 2025 by SonuDixit Loading…
5 tasks
Update import structure
#4665 opened Dec 11, 2025 by qgallouedec Draft
Add GRPO QLoRA free notebook
#4660 opened Dec 10, 2025 by sergiopaniego Draft
5 tasks
[WIP] GRPO-inspired Online DPO refactor
#4659 opened Dec 10, 2025 by d-tiapkin Draft
2 of 7 tasks
feature: Add RTPO Trainer
#4652 opened Dec 9, 2025 by SolarWindRider Loading…
6 tasks done
Set version to packaged one in notebooks
#4648 opened Dec 9, 2025 by sergiopaniego Loading…
5 tasks
Preserve truncated tokens in BFD packing
#4632 opened Dec 5, 2025 by qgallouedec Loading…
Update docs landing with latest details
#4624 opened Dec 4, 2025 by sergiopaniego Loading…
6 tasks
Add PSPO trust region method as alternative to clipping in GRPOTrainer
#4548 opened Nov 19, 2025 by MCDwyer Loading…
2 of 5 tasks
ProTip! Add no:assignee to see everything that’s not assigned.