Skip to content

Conversation

@skinreich11
Copy link

encoder logic

PhilipGe and others added 3 commits March 17, 2025 21:49
… for DeepQ model training and utilization. Calls to API can be injected into code of agent traversing a state space to serve Q values. API serves Q values based on a calling agent's state, action, resulting state if action is taken, and resulting reward if action is taken: (s,a,s',r) by passing them through a neural net. Nueral net is trained through the 'explore' function that updates weights based on a specific instance of (s,a,s',r).
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants