This project illustrates how to train an agent to navigate (and collect bananas!) in a large, square world.
A reward of +1 is provided for collecting a yellow banana, and a reward of -1 is provided for collecting a blue banana. Thus, the goal of our agent is to collect as many yellow bananas (healthy) as possible while avoiding blue bananas (rotten).
The state space has 37 dimensions and contains the agent's velocity, along with a ray-based perception of objects around the agent's forward direction. Given this information, the agent has to learn how to best select actions. Four discrete actions are available, corresponding to:
0
- move forward.1
- move backward.2
- turn left.3
- turn right.
The task is episodic, and to solve the environment, our agent must get an average score of +13 over 100 consecutive episodes.
-
Download the environment from one of the links below. You need only select the environment that matches your operating system:
- Linux: click here
- Mac OSX: click here
- Windows (32-bit): click here
- Windows (64-bit): click here
(For Windows users) Check out this link if you need help with determining if your computer is running a 32-bit version or 64-bit version of the Windows operating system.
Note unfortunately, I could only test the project in OSX 10.11.6.
-
Place the file in the root folder of this repository, and unzip (or decompress) the file.
-
Install all the required dependencies:
The main requirements of this project are Python==3.6, numpy, matplotlib, jupyter, pytorch and unity-agents. To ease its installation, I recommend the following procedure:
-
Feel free to skip this step, if you already have anaconda or miniconda installed in your machine.
For OSX users, I would recommend trying the step outlined here
-
Creating the environment.
conda create -n drlnd-navigation python=3.6
-
Activate the environment
conda activate drlnd-navigation
-
Installing dependencies.
pip install -r requirements.txt
-
You can use the environment YAML file provided with repo as follows:
conda env create -f environment_osx.yml
Launch a jupyter notebook and follow the tutorial in Navigation.ipynb to train your own agent!
In case you close the shell running the jupyter server, don't forget to activate the environment.
conda activate drlnd-navigation
Please gimme a โญ๏ธ in the GitHub banner ๐. I am also open for discussions especially accompany with โ or ๐บ.