This project implements a Q-Learning agent trained on the FrozenLake-v1 environment as part of the Hugging Face Deep Reinforcement Learning course.
- FrozenLake-v1
- 4x4 grid
- No slippery (deterministic)
Tabular Q-Learning with Ξ΅-greedy exploration.
Q-update rule:
Q(s,a) = Q(s,a) + Ξ± (r + Ξ³ max Q(s',a') - Q(s,a))
The trained agent consistently reaches the goal in the deterministic FrozenLake environment.
https://huggingface.co/BhushanGatty/q-FrozenLake-v1-4x4-noSlippery
Bhushan Gatty
Robotics Engineering Student
Deep Reinforcement Learning Enthusiast
