Project 2: Reacher - Continuous control

Submission of Reached problem using DDPG

The Environment

In this problem, the environment is a robotic arm with two pivots, each of which have 2 motors. The aim is for the agent to maintain the end of the arm (dark blue sphere) inside the target (translucent sphere).

State

The state space is 33 variables corresponding to position, rotation, velocity, and angular velocities for each robotic arm.

Actions

Each robotic arm has 4 motors which can be given a torque value of between -1 and 1, hence the environment must be given a vector of length 4.

Rewards

The agent receives a +0.1 reward for each timestep that the agent remains in contact with the target sphere.

Solution

The aim of the environment is to keep the end of the robotic arm inside a target sphere so that the average score across all 20 arms for 100 consecutive episodes is above 30.

Getting Started

The dependencies that are required can be installed using the following:

First, install conda: https://www.anaconda.com/distribution/#download-section

Next, create a new conda environment and activate

conda create -n Reacher python=3.6.3 anaconda

conda activate Reacher

Next install pytorch using: conda install pytorch=0.4.0 cuda80 -c pytorch

And ml-agents ugin: pip install mlagents==0.4.0

Finally, the environment and scripts are downloaded from

git clone https://github.com/SamJCKnox/P2_Reacher_Submission.git

Instructions

The DDPGMultiAgent script is the header which calls all scripts required to run. Run all sections to train the agent. Outputs will show how the agent is performing. The last section shows the agent evaluation.

The networks trained in the current outputs of the Jupyter Notebook are in BenchmarkNetworks, copy these into the root directory to view in the evaluation section.

Report.md shows the architecture of the networks with the hyperparamteres.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
.ipynb_checkpoints		.ipynb_checkpoints
BenchmarkNetwork		BenchmarkNetwork
Reacher_Windows_x86_64		Reacher_Windows_x86_64
__pycache__		__pycache__
.gitattributes		.gitattributes
ActorCriticDrawing.png		ActorCriticDrawing.png
DDPGMultiAgent.ipynb		DDPGMultiAgent.ipynb
README.md		README.md
Report.md		Report.md
ScoresDDPG.png		ScoresDDPG.png
checkpoint_actor.pth		checkpoint_actor.pth
checkpoint_critic.pth		checkpoint_critic.pth
ddpg_agent.py		ddpg_agent.py
model.py		model.py
runner.py		runner.py
unity-environment.log		unity-environment.log

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Project 2: Reacher - Continuous control

The Environment

State

Actions

Rewards

Solution

Getting Started

Instructions

About

Uh oh!

Releases

Packages

Languages

SamJCKnox/P2_Reacher_Submission

Folders and files

Latest commit

History

Repository files navigation

Project 2: Reacher - Continuous control

The Environment

State

Actions

Rewards

Solution

Getting Started

Instructions

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages