Return and value funtions definitions for RL (pages 680-683) #47
labdmitriy
started this conversation in
General
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi Sebastian,
There is a definition of the return on the page 680, where we have multiplier gamma^k in the sum for R(t+k+1).
But for value and action-value functions we have gamma^(k+1) multiplier for the same R(t+k+1) on the pages 682-683.
Why do we replace the term with gamma^(k+1) multiplier instead of gamma^k in the latter definitions?
Thank you.
Beta Was this translation helpful? Give feedback.
All reactions