This project implements a Q-Learning agent trained on the Taxi-v3 environment as part of the Hugging Face Deep Reinforcement Learning Course (Unit 2).
- Taxi-v3
- 500 states
- 6 actions
Tabular Q-Learning with epsilon-greedy exploration.
Mean reward > 4 (assignment requirement satisfied).
https://huggingface.co/BhushanGatty/q-Taxi-v3
Bhushan Gatty
Robotics Engineering Student
Deep Reinforcement Learning Enthusiast
