You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[skyrl-train] Fix num_training_steps for workers being set incorrectly (#873)
`num_training_steps` was being set to the number of training batch steps
rather than the number of optimizer (mini-batch) steps, causing learning
rate decay to progress too quickly if using a non-constant learning rate
scheduler.
renames to `num_training_batches` for clarity, since each training batch
can contain several optimizer steps.
Closes#872
0 commit comments