GitHub - nikxtaco/RL-CNN-Maze

Setup

Install required packages with:

pip install gymnasium matplotlib opencv-python pygame stable_baselines3

To set up the maze environment run:

python setup.py install

Environment Details

One agent attempts to get to the fixed end of the maze (green) while the other attempts to get to the randomly spawning object (red).
Each env generated per agent, goes through a series of mazes that are identical across the two agents.
The maze is reset either when (i) the episode length has reached max timesteps (500) or (ii) the agent has reached its goal.

Stats Recorded

In log_agent#.csv file: For each agent, separate log files are recorded with the episode number and environment number. The number of timesteps when the agent reaches the goal state (since the last maze reset) is recorded.

In performance_agent#.csv file: For each agent, separate performance files are recorded with the episode number and the following metrics for that episode - cumulative reward, pi loss, value loss, entropy loss.

Current Functionality Summary

Run train_agent#.py to train
Models are saved into saved_models/agent#/{timestep}.pt
Log data is saved into log_agent#.csv
Performance data is saved into performance_agent#.csv
Run plot_agent#.py to visualize performance the mean reward and loss graphs.

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
agents		agents
common		common
gym_maze		gym_maze
saved_models		saved_models
README.md		README.md
log_agent0.csv		log_agent0.csv
log_agent1.csv		log_agent1.csv
old_log_agent1.csv		old_log_agent1.csv
old_performance_agent1.csv		old_performance_agent1.csv
performance_agent0.csv		performance_agent0.csv
performance_agent1.csv		performance_agent1.csv
plot_agent0.py		plot_agent0.py
plot_agent1.py		plot_agent1.py
train_agent0.py		train_agent0.py
train_agent1.py		train_agent1.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Setup

Environment Details

Stats Recorded

Current Functionality Summary

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 6

Uh oh!

Languages

nikxtaco/RL-CNN-Maze

Folders and files

Latest commit

History

Repository files navigation

Setup

Environment Details

Stats Recorded

Current Functionality Summary

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 6

Uh oh!

Languages

Packages