Skip to content

Fix GRPO to conform with TRL: Fix loss, make tests accurate, correct metrics computation #849

Fix GRPO to conform with TRL: Fix loss, make tests accurate, correct metrics computation

Fix GRPO to conform with TRL: Fix loss, make tests accurate, correct metrics computation #849

Annotations

2 errors

The logs for this run have expired and are no longer available.