Git Product home page Git Product logo

trpo--jjkke88's Introduction

recently the algorithm has been moved to https://github.com/jjkke88/RL_toolbox

trpo

trust region policy optimitztion base on gym and tensorflow

There are three versions of trpo, one for decrete action space like mountaincar, one for decreate action space task with image as input like atari games, and the last for continuous action space for pendulems.

The environment is base on openAI gym.

part of code refer to rllab

dependency

  • tensorflow 0.10
  • prettytensor
  • latest openai gym

constructure for code

  • baseline:baseline estimation of baseline function
  • checkpoint:folder to store model file, can not be delete or will cause some error
  • distribution:distribution base class, it can be used to calculate probability of distributions, for example Gaussian.
  • logger:have a Logger class for log data to .csv file
  • agent:for disperse action space and continous action space
  • log:store log file
  • experiment: contain many different main file, run main file can start trainning or testing
  • environment.py: environment
  • krylov.py: implement of some math method:conjugate gradient descent , calculating hessian matrix
  • parameters.py: config file
  • utils.py: implement of some basic function: getFlat, setFlat, lineaSearch

recent work

  • imple multi-thread trpo run python main_multi_thread.py to try
  • imple tensorflow distributed trpo
  • imple trpo multi-process

future work

  • complete trpo with image as input

trpo--jjkke88's People

Contributors

aqrobotics avatar wojzaremba avatar jjkke88 avatar openai-sys-okta-integration avatar zhongwen avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.