Git Product home page Git Product logo

rl-ammi's Introduction

RL-AMMI

AMMI, Deep RL, Fall 2021: RL Implementation for Continuous Control Tasks

Course and Project details

This Deep RL course was taught at The African Master's in Machine Intelligence AMMI in Fall 2021. It was instructed by researchers at DeepMind: Bilal Piot, Corentin Tallec and Florian Strub. This project is the coursework of Deep RL where we Catalyst Agents team trying to re-implement RL algorithm(s) for continuous control tasks. We chose three types of environments: easy, medium, and hard to run the algorithm(s). The course project meant to submit only one algorithm, but we plan to continue working on this repo making it an open project by this team of student from AMMI. This is why we're trying to make a modular repo to ease the re-implementation of future algorithms.

Algorithm:

Algorithm we re-implementing/plannning to re-implement:

  1. Soft Actor-Critic (SAC) Paper (Now)

  2. Model-Based Policy Optimization (MBPO) Paper (Next; Future work)

  3. Model Predictive Control-Soft Actor Critic (MPC-SAC) Paper (Next; Future work)

  4. Model Predictive Actor-Critic (MoPAC) Paper (Next; Future work)

How to use this code

Installation

Ubuntu 20.04

Move into rl-ammi directory, and then run the following:

conda create -n rl-ammi python=3.8

pip install -e .

pip install numpy

pip install torch

pip install wandb

pip install gym

If you want to run MuJoCo Locomotion tasks, and ShadowHand, you should install MuJoCo first (it's open sourced until 31th Oct), and then install mujoco-py:

sudo apt-get install ffmpeg

pip install -U 'mujoco-py<2.1,>=2.0'

If you are using A local GPU of Nvidia and want to record MuJoCo environments issue link, run:

unset LD_PRELOAD

MacOS

Move into rl-ammi directory, and then run the following:

conda create -n rl-ammi python=3.8

pip install -e .

pip install numpy

pip install torch

pip install wandb

pip install gym

If you want to run MuJoCo Locomotion tasks, and ShadowHand, you should install MuJoCo first (it's open sourced until 31th Oct), and then install mujoco-py:

brew install ffmpeg

pip install -U 'mujoco-py<2.1,>=2.0'

If you are using A local GPU of Nvidia and want to record MuJoCo environments issue link, run:

unset LD_PRELOAD

Run an experiment

Move into rl-ammi/ directory, and then:

python experiment.py -cfg <cfg_file-.py> -seed <int>

for example:

python experiment.py -cfg sac_hopper -seed 1

Evaluate an Agent

To evaluate a saved policy model, run the following command:

python evaluate_agent.py -env <env_name> -alg <alg_name> -seed <int> -EE <int>

for example:

python evaluate_agent.py -env Walker2d-v2 -alg SAC -seed 1 -EE 5

Experiments and Results

Classic Control

MuJoCo Locomotion

ShadowHand

Catalyst Agents Team, Group 2

(first name alphabetical order)

Accknowledgement

This repo was inspired by many great repos, mostly the following ones:

rl-ammi's People

Contributors

wafaa014 avatar mohammedelfatihsalah avatar ruba128 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.