A3C-tensorflow

Implementation of A3C using TensorFlow v0.9(But it is easy to modify and run it on higher versions)

Prerequisites

From Here, clone multi thread supported arcade learning environment. make and install it. Modifications to ale is necessary to avoid multi thread problems

Usage

$ python main.py

There are several options to change learning parameters and behaviors.

rom: Atari rom file to play. Defaults to breakout.bin.
threads_num: Number of learner threads to run in parallel. Defaults to 8.
local_t_max: Number of steps to look ahead. Defaults to 5.
global_t_max: Number of iterations to train. Defaults to 8e7 (80 million). Learning rate will decrease propotional to this value.
use_gpu: Whether to use gpu or cpu. Defaults to True. To use cpu set it to False.
shrink_image: Whether to just shrink or trim and shrink state image. Defaults to False.
life_lost_as_end: Treat life lost in the game as end of state. Defaults to True.
evaluate: Evaluate trained network. Defaults to False.

Options can be used like follows

$ python main.py --rom="pong.bin" --threads_num=4

Results

A3C-FF breakout

The result trained for 80 Million steps with 8 threads. It took about 40 hours with 8 core Ryzen 1800X.

A3C-FF pong

The result trained for 80 Million steps with 8 threads. It took about 34 hours with 8 core Ryzen 1800X.

To load and watch trained network result

$ python main.py --evaluate=True --checkpoint_dir=trained_results/breakout/ --trained_file=network_parameters-80002500

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
trained_results		trained_results
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
a3c_network.py		a3c_network.py
a3c_test.py		a3c_test.py
actor_learner_thread.py		actor_learner_thread.py
ale_environment.py		ale_environment.py
constants.py		constants.py
environment.py		environment.py
main.py		main.py
shared_network.py		shared_network.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

A3C-tensorflow

Prerequisites

Usage

Results

A3C-FF breakout

A3C-FF pong

To load and watch trained network result

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

yuishihara/A3C-tensorflow

Folders and files

Latest commit

History

Repository files navigation

A3C-tensorflow

Prerequisites

Usage

Results

A3C-FF breakout

A3C-FF pong

To load and watch trained network result

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages