Fix GRPO to conform with TRL: Fix loss, make tests accurate, correct metrics computation #827
Annotations
1 error
|
checkstyle
Process completed with exit code 2.
|