Git Product home page Git Product logo

ryannnxu's Projects

a3c-pytorch icon a3c-pytorch

PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch

a3c_tensorflow icon a3c_tensorflow

Tensorflow implementation of 'Asynchronous Methods for Deep Reinforcement Learning'

ai-blog icon ai-blog

Accompanying repository for Let's make a DQN / A3C series.

async-rl icon async-rl

Tensorflow + Keras + OpenAI Gym implementation of 1-step Q Learning from "Asynchronous Methods for Deep Reinforcement Learning"

async-rl-1 icon async-rl-1

Variation of "Asynchronous Methods for Deep Reinforcement Learning" with multiple processes generating experience for agent (Keras + Theano + OpenAI Gym)[1-step Q-learning, n-step Q-learning, A3C]

asynchronous-methods-for-deep-reinforcement-learning icon asynchronous-methods-for-deep-reinforcement-learning

Using a paper from Google DeepMind I've developed a new version of the DQN using threads exploration instead of memory replay as explain in here: http://arxiv.org/pdf/1602.01783v1.pdf I used the one-step-Q-learning pseudocode, and now we can train the Pong game in less than 20 hours and without any GPU or network distribution.

binaryconnect icon binaryconnect

Training Deep Neural Networks with binary weights during propagations

binarynet icon binarynet

Training Deep Neural Networks with Weights and Activations Constrained to +1 or -1

colmap icon colmap

COLMAP - Structure-from-Motion and Multi-View Stereo

ddpg-aigym icon ddpg-aigym

Continuous control with deep reinforcement learning - Deep Deterministic Policy Gradient (DDPG) algorithm implemented in OpenAI Gym environments

deep_q_rl icon deep_q_rl

Theano-based implementation of Deep Q-learning

deepfool icon deepfool

A simple and accurate method to fool deep neural networks

dqn icon dqn

This is a very basic DQN implementation, which uses OpenAI's gym environment and Keras/Theano neural networks.

dqn-tensorflow icon dqn-tensorflow

Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning

foolgo icon foolgo

A Go A.I. based on MCTS(AlphaGo's basic algorithm) WITHOUT Deep Learning

mcts icon mcts

Monte-Carlo Tree Search (MCTS) basic implementation.

michi icon michi

Minimalistic Go MCTS Engine

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.