Git Product home page Git Product logo

jrlzoo's Introduction

Jrlzoo

An RL codebase in JAX which focuses on efficiency and simplicity. Each baseline algorithm is self-contained in the corresponding folder.

On-policy RL

  • PPO

Off-policy RL

  • DDPG
  • DQN
  • SAC
  • TD3

Offline RL

  • COMBO
  • CQL
  • IQL
  • TD3BC

Fine-tuning Offline RL

  • AWAC
  • Off2OnRL
  • PEX

Meta RL

  • HypeRL
  • MAML
  • MQL
  • PEARL

Exploration

  • RND

Acknowledgements

jrlzoo's People

Contributors

fuyw avatar

Stargazers

 avatar Sacha Chernyavskiy avatar Maxim Bobrin avatar CTP_314 avatar Frank Röder avatar  avatar Neil J. Yan avatar  avatar

Watchers

Kostas Georgiou avatar  avatar

jrlzoo's Issues

About the performance of TD3 on halfcheetah-v2

Hi,

Thanks for your great repo.

I try to run the code of TD3 on halfcheetah-v2 and I find the return is not very stable and below the result that you report.

More specially, I try to train the model with 1e6 steps (not 3e6 in the config file) and the return is about 9500.

So I wonder whether I need to tune the hyper-parameters.

Thank you very much. & Thanks for your contribution again.

About the performanfe of td3bc on antmaze

Hi

I try to run the code of td3bc on antmaze-medium-play-v0 and I find the performance (is 0) is lower than the results in iql paper (10.6). May I ask the experimental setting about that?

Thank you very much!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.