Skip to content

Latest commit

 

History

History
25 lines (17 loc) · 412 Bytes

File metadata and controls

25 lines (17 loc) · 412 Bytes

Solving Pong w/ Reinforcement Learning (Policy Gradients)

Install

python3.11 -m venv venv
source venv/bin/activate
pip install -r requirements.txt

Train

python train.py --epochs <NUM_EPOCHS>

After training for 1,000 epochs, the model becomes able to outplay the opponent.

Run

python run.py

To use pretrained weights, rename model_1000.pt to model.pt and run.