Git Product home page Git Product logo

iwies's Introduction

Instance Weighted Incremental Evolution Strategies (IW-IES)

This repo contains code accompanying the paper: Zhi Wang, Chunlin Chen, and Daoyi Dong, "Instance Weighted Incremental Evolution Strategies for Reinforcement Learning in Dynamic Environments", IEEE Transactions on Neural Networks and Learning Systems, 2022. It contains code for running the incremental learning tasks, including 2D navigation, Swimmer, Hopper, and HalfCheetah domains. The basic reinforcement learning algorithms are implemented using natural evolution strategies.

Dependencies

This code requires the following:

  • python 3.5+
  • pytorch 0.4+
  • gym
  • MuJoCo license

Data

  • For the 2D navigation domain, data is generated from envs/navigation.py
  • For the Swimmer/Hopper/HalfCheetah Mujoco domains, the modified Mujoco environments are in envs/mujoco/*

Usage

  • For example, in Case I of the navigation domain, just run the bash script navi_v1_iwies.sh to get the results of iwies and its ablation methods, also see the usage instructions in the script and main.py; just run the bash script navi_v1_baselines.sh to get the results of the baselines including Robust, Hist, SO-CMA, and ES-MAML, also see the usage instructions in the script and baselines.py
  • When getting the results in output/*/*.npy files, plot the results using data_process.py. For example, the results for the navigation domains are as follows:
Case I Case II Complex Case
iwies results for Case I iwies results for Case II iwies results for Complex Case
baseline results for Case I baselines results for Case II baselines results for Complex Case

Note that these results are from a single run of the code. You can randomly change the environment to a new one, and record the performance of all tested methods when adapting to the new environment. In our paper, we repeat the process ten times and report the mean and standard error to demonstrate the performance for learning in stochastic dynamic environments. For example, the results for Case I of navigation domain and the swimmer domain are as follows:

navigation_v1 swimmer
experimental results for navigation_v1 domain experimental results for half cheetah domain

Also, the results for other demo scripts are shown in exp/*

Contact

To ask questions or report issues, please open an issue on the issues tracker, or email to [email protected].

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.