On GAE calculation math #593
michael-lutz
started this conversation in
General
Replies: 2 comments
-
the only logical explaination i could come up with was :
correct me if I'm wrong? |
Beta Was this translation helpful? Give feedback.
0 replies
-
The extra 0.5 factor indeed just scales the value loss. I've futzed with a I'm not really sure about the deviation in the GAE implementation, but using |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
In
losses.py
, I noticed that the code includes the following step before returning value targets and advantages:From what I understand,
compute_vs_minus_v_xs
should return the standard GAE result. Why do we perform an additional TD computation at the end?Second, I was hoping to ask why the value loss includes an extra
0.5
term:Both these decisions seem non-standard. Did you find they improved performance empirically?
Beta Was this translation helpful? Give feedback.
All reactions