Project: Collaboration and Competition

Introduction

This project is my solution for the Tennis environment.

The Unity tennis environment has two agents bouncing a ball back and forth over a net using rackets. A reward of +0.1 is recieved if you hit the ball to the other side. If the ball falls or goes out of bounds, a reward of -0.01 is recieved. So, the goal is to keep hitting the ball over as long as you can.

The observation space has 8 variables for the position and velocity of the ball and racket. Each agent receives its own, local observation. Two continuous actions are available, corresponding to movement toward (or away from) the net, and jumping.

Getting Started

1. You need to setup the environment before you can play with the code.

I coded everything on MacOS, but these instructions should work for Windows and Linux as well.

Create a virtual enviroment with python 3.6 and activate. All equirements will be installed in this enviroment.
Download the repo of Unity ML-Agents Toolkit at Unity ML-Agents Toolkit and follow the install instructions
The project requirements additionally include Pytorch. A simple pip install torch would suffice.

Create an IPython kernel for the drlnd environment.

python -m ipykernel install --user --name env_name --display-name "kernel_name"

2. Clone this GitHub repository onto your machine.

3. Download the unity environment from one of the links below. You need only select the environment that matches your operating system:

Linux: click here
Mac OSX: click here
Windows (32-bit): click here
Windows (64-bit): click here

3. Open your Jupyter Notebook, open the Tennis.ipynb file and select the apropiate kernel (kernel_name set in step 1).

4. Follow the instructions in `Tennis.ipynb` to train the Agent or view the pretrained agent interact with the enviroment

Running the Code

There are 3 parts to the code:

1. Exploring the enviroment and basic simulation using a random behavior agent.

These cells are described in markdowns.

Start the Environment
Examine the State and Action Spaces
Take Random Actions in the Environment

2. Training the DDPG Agent.

Corresponds to the markdown Solve: Training the Agent with DDPG
The agent stops when it reaches target average score of 0.5 over 100 episodes.

3. Viewing the Trained Agent.

Corresponds to the markdown Watch a Smart Agent!.
We load the trained model weights from the Saved Weights folder.
To view the trained agent without going through the hassle of training, simply run all the cells in the first and the third part.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.ipynb_checkpoints		.ipynb_checkpoints
Images		Images
Videos		Videos
Weights		Weights
README.md		README.md
Report.md		Report.md
Tennis.ipynb		Tennis.ipynb
agent.py		agent.py
model.py		model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Project: Collaboration and Competition

Introduction

Getting Started

1. You need to setup the environment before you can play with the code.

2. Clone this GitHub repository onto your machine.

3. Download the unity environment from one of the links below. You need only select the environment that matches your operating system:

3. Open your Jupyter Notebook, open the Tennis.ipynb file and select the apropiate kernel (kernel_name set in step 1).

4. Follow the instructions in `Tennis.ipynb` to train the Agent or view the pretrained agent interact with the enviroment

Running the Code

1. Exploring the enviroment and basic simulation using a random behavior agent.

2. Training the DDPG Agent.

3. Viewing the Trained Agent.

About

Uh oh!

Releases

Packages

Languages

TheGrayFrost/Tennis-Agent

Folders and files

Latest commit

History

Repository files navigation

Project: Collaboration and Competition

Introduction

Getting Started

1. You need to setup the environment before you can play with the code.

2. Clone this GitHub repository onto your machine.

3. Download the unity environment from one of the links below. You need only select the environment that matches your operating system:

3. Open your Jupyter Notebook, open the Tennis.ipynb file and select the apropiate kernel (kernel_name set in step 1).

4. Follow the instructions in Tennis.ipynb to train the Agent or view the pretrained agent interact with the enviroment

Running the Code

1. Exploring the enviroment and basic simulation using a random behavior agent.

2. Training the DDPG Agent.

3. Viewing the Trained Agent.

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

4. Follow the instructions in `Tennis.ipynb` to train the Agent or view the pretrained agent interact with the enviroment

Packages