Welcome to my Reinforcement Learning (RL) Playground! This repository is a personal project where I implement various RL algorithms from scratch, purely for fun and learning. The goal is to build a solid intuition behind RL concepts by implementing them step by step, without relying on high-level libraries.
- β Value Iteration
- β Policy Iteration
- β Monte Carlo Methods
- β Temporal Difference (TD) Learning
- β Q-Learning
- β SARSA
- β Deep Q-Network (DQN)
- β REINFORCE (Monte Carlo Policy Gradient)
- π Proximal Policy Optimization (PPO) (planned)
- π Deep Determinstic Policy Gradient (DDPG) (planned)
- π Twin-Delayed Deep Determinstic Policy Gradient (TD3) (planned)
- π Soft Actor-Critic (SAC) (planned)