Fix GRPO to conform with TRL: Fix loss, make tests accurate, correct metrics computation #825
Annotations
1 error
|
checkstyle
Process completed with exit code 2.
|