Git Product home page Git Product logo

expertrl's Introduction

Expert-guided Data-efficient RL

Case 1: sub-optimal RL agent as expert knowledge

Case 2: low-dimensional dynamics as expert knowledge

Dataset

Offline demonstration from Walker2d-v2 environment of Mujoco

mkdir demon && cd demon/
wget https://rail.eecs.berkeley.edu/datasets/offline_rl/gym_mujoco_v2/walker2d_expert-v2.hdf5
wget https://rail.eecs.berkeley.edu/datasets/offline_rl/gym_mujoco_v2/walker2d_medium-v2.hdf5
wget https://rail.eecs.berkeley.edu/datasets/offline_rl/gym_mujoco_v2/walker2d_random-v2.hdf5

Offline demonstration from domain name 'walker' and task name 'walk' of DeepMind Control Suite

  • access the link from V-D4RL
  • download the folder main/walker_walk/medium/64px
  • the overall file structure becomes:
ExpertRL
└───64px
│   └───20220109T013713-488d0464873a40c99446c53b1468e1c9-501.npz
│   └───20220109T013713-681bef276feb456c8096851ae536d2a2-501.npz
│   │   ...
│   └───20220109T013743-e3b1c9be2e534355a7427ffcceb9d042-501.npz
└───demon
│   └───walker2d_expert-v2.hdf5
│   └───walker2d_medium-v2.hdf5
│   └───walker2d_random-v2.hdf5
└───dmc2gym
└───LowDynamics
│   └───README.md
│   │   ...
└───SubOptimal
│   └───README.md
│   │   ...
└───README.md

Installation

docker container

docker pull pytorch/pytorch:1.13.1-cuda11.6-cudnn8-devel
docker run -it --shm-size=8G \
    --gpus=all --restart=always \
    pytorch/pytorch:1.13.1-cuda11.6-cudnn8-devel \
    /bin/bash

additional library

pip install h5py

simulated environment

Mujoco: We followed the instruction in mujoco-py.

wget https://mujoco.org/download/mujoco210-linux-x86_64.tar.gz
tar -zxvf mujoco210-linux-x86_64.tar.gz
mkdir ~/.mujoco && mv mujoco210 ~/.mujoco/

export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/root/.mujoco/mujoco210/bin

pip3 install -U 'mujoco-py<2.2,>=2.1'
pip install mujoco
pip install gym[mujoco]

pip install 'cython<3'
apt install libosmesa6-dev libgl1-mesa-glx libglfw3
ln -s /usr/lib/x86_64-linux-gnu/libGL.so.1 /usr/lib/x86_64-linux-gnu/libGL.so

Deepmind Control Suite: We used the image wrapper in dmc2gym with a few modifications using local install, which is necessary for the updated mujoco version.

cd dmc2gym/ && pip install -e .

expertrl's People

Contributors

gokyeongryeol avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.