Replies: 2 comments
-
Hi @ZaberKo , this question came up here #593 We've considered adding a v_loss_coef as a hyper but it never made too much of a diff. If you find that it helps your policies significantly, please send us a PR or issue |
Beta Was this translation helpful? Give feedback.
0 replies
-
After some tests, I agree the value of v_loss_coef with 0.5 or 0.25 makes no diff. Nevertheless, it is strange to put this extra 0.5, which may lead to some confusion. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
In brax's PPO implementation, why there are duplicate 0.5 in v_loss? In my experience, the coefficient should be 0.5 or 1.0, instead of 0.25.
brax/brax/training/agents/ppo/losses.py
Line 182 in 19bbd9b
Beta Was this translation helpful? Give feedback.
All reactions