generated from fastai/nbdev_template
    
        
        - 
                Notifications
    
You must be signed in to change notification settings  - Fork 2.3k
 
Pull requests: huggingface/trl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
      Support casting to fp32 when word embeddings are tied to lm_head
      
    
      
  
        
          #4446
            opened Nov 3, 2025  by
            pramodith
            
        
        
            
    
  
    Loading…
 
        
          
   
        
      
    
      
        
      
      
  
    4 of 5 tasks
  
      Moved masked_mean, masked_var and masked_whiten to ppo_trainer.py
      
    
      
  
        
          #4444
            opened Nov 3, 2025  by
            Harras3
            
        
        
            
    
  
    Loading…
 
        
          
   
        
      
    
      
        
      
      
  
    2 tasks done
  
      [GRPOTrainer bug fix] a little bug in completions with bootstrap
      
    
        
          #4442
            opened Nov 3, 2025  by
            SolarWindRider
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      added 10 papers (+trainer cross-links) for #4407
      
    
      
  
        
          #4441
            opened Nov 3, 2025  by
            SSusantAchary
            
        
        
            
    
  
    Loading…
 
        
          
   
        
      
    
      
        
      
      
  
    4 tasks done
  
      docs: add KTO (2402.01306) to Paper Index + link ref to KTOTrainer
      
    
      
  
        
          #4440
            opened Nov 3, 2025  by
            SSusantAchary
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      refactor: Move judges to experimental submodule
      
    
      
  
        
          #4439
            opened Nov 3, 2025  by
            behroozazarkhalili
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      refactor: Move Mergekit integration to experimental submodule
      
    
      
  
        
          #4438
            opened Nov 3, 2025  by
            behroozazarkhalili
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      docs: Move Multi-Adapter RL section to PEFT integration
      
    
      
  
        
          #4436
            opened Nov 3, 2025  by
            behroozazarkhalili
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      docs: Unify model examples to use trl-lib namespace
      
    
      
  
        
          #4431
            opened Nov 2, 2025  by
            behroozazarkhalili
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      docs: Add PEFT subsection to reducing memory usage guide
      
    
      
  
        
          #4430
            opened Nov 2, 2025  by
            behroozazarkhalili
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      docs: Expand speeding up training guide with acceleration methods
      
    
      
  
        
          #4428
            opened Nov 2, 2025  by
            behroozazarkhalili
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      docs: Expand training customization examples
      
    
      
  
        
          #4427
            opened Nov 2, 2025  by
            behroozazarkhalili
            
        
        
            
    
  
    Loading…
 
        
          
   
        
      
    
      
        
      
      
  
    4 tasks done
  
      Replace flash attention2 with kernels-community/flash-attn2
      
    
      
  
        
          #4426
            opened Nov 2, 2025  by
            tamoghnokandar
            
        
        
            
    
  
    Loading…
 
        
          
   
        
      
    
      
        
      
      
  
    4 of 5 tasks
  
      docs: Extend CLI basic usage examples to all supported CLIs
      
    
      
  
        
          #4425
            opened Nov 2, 2025  by
            behroozazarkhalili
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      Removed outdated warning about batch contamination
      
    
      
  
        
          #4423
            opened Nov 2, 2025  by
            Harras3
            
        
        
            
    
  
    Loading…
 
        
          
   
        
      
    
      
        
      
      
  
    2 tasks done
  
      docs: Rewrite PEFT integration guide with comprehensive examples
      
    
      
  
        
          #4421
            opened Nov 2, 2025  by
            behroozazarkhalili
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      Add tip for logging evaluation metrics during regular evaluations
      
    
      
  
        
          #4367
            opened Oct 29, 2025  by
            cam1llynha
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      [OpenENV] Openenv rollout_func signature proposal
      
    
      
  
        
          #4344
            opened Oct 27, 2025  by
            kashif
            
        
        
            
    
  
    Loading…
 
        
          
   
        
      
    
      
        
      
      
  
    5 tasks
  
      Use explicit tiny-Qwen2ForCausalLM-2.5 model_id param in CI tests
      
    
      
  
        
          #4331
            opened Oct 23, 2025  by
            albertvillanova
            
        
        
            
    
  
    Loading…
 
        
        
      
    Previous Next
  
  
  ProTip!
  Find all pull requests that aren't related to any open issues with -linked:issue.