Git Product home page Git Product logo

continuousqlearning's Introduction

ContinuousQLearning

A (maybe working?) implementation of the first part of this paper: https://arxiv.org/pdf/1603.00748.pdf, tested on the OpenAI Pendulum task. Not very well documented / organized at present. Ideally I'll be able to make it robust enough to work across many tasks with minimal tuning (which may require implementing other features described in the paper). I also plan to try integrating the algorithm into some recurrent attention models (e.g. https://github.com/jlindsey15/RAM and possibly a modified version of https://github.com/jlindsey15/DRAM).

The following were/are helpful as references -- at the moment I don't think my code doesn't offer any more significant functionality than these... but more to come!

https://gist.github.com/tambetm/78227e1a15c52fbbcaeef7715dd079f0#file-pendulum-v0-md https://github.com/carpedm20/NAF-tensorflow

continuousqlearning's People

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.