generated from fastai/nbdev_template
-
Notifications
You must be signed in to change notification settings - Fork 2.4k
Pull requests: huggingface/trl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Import
TrainerCallback from top-level transformers
#4694
opened Dec 15, 2025 by
qgallouedec
Loading…
5 tasks
fix: invalidate ZeRO-3 param coordinator trace in add_hooks
#4693
opened Dec 15, 2025 by
roycho96
Loading…
1 of 5 tasks
Fix KeyError with transformers 5.0.0+ where push_to_hub_token is removed
#4691
opened Dec 14, 2025 by
Manodeepray
Loading…
3 tasks done
GKDTrainer: Fix return_outputs in Liger kernel path and update tests
#4688
opened Dec 13, 2025 by
roycho96
Loading…
2 of 5 tasks
Move
prepare_model_for_kbit_training, enable_gradient_checkpointing, prepare_peft_model to experimental.utils
#4686
opened Dec 12, 2025 by
qgallouedec
Loading…
Move
get_reward function to experimental.utils
#4683
opened Dec 12, 2025 by
qgallouedec
Loading…
5 tasks
loss calculation for evaluation without training
#4673
opened Dec 11, 2025 by
SonuDixit
Loading…
5 tasks
Overwrite model default generation config used by model.generate
#4647
opened Dec 9, 2025 by
albertvillanova
Loading…
7 of 9 tasks
CPOTrainer - Incorrect handling of different length chosen/rejected p…
#4639
opened Dec 8, 2025 by
davmels
Loading…
Support async reward functions and parallelize call to reward functions.
#4567
opened Nov 24, 2025 by
pramodith
Loading…
3 of 5 tasks
Add cross-tokenizer distillation support for GKD and MiniLLM trainers
#4561
opened Nov 22, 2025 by
sambhavnoobcoder
Loading…
Add PSPO trust region method as alternative to clipping in GRPOTrainer
#4548
opened Nov 19, 2025 by
MCDwyer
Loading…
2 of 5 tasks
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.