Git Product home page Git Product logo

liao-wk's Projects

airship icon airship

方向改了,从此开始做调度了。

asynchronous-methods-for-deep-reinforcement-learning icon asynchronous-methods-for-deep-reinforcement-learning

Using a paper from Google DeepMind I've developed a new version of the DQN using threads exploration instead of memory replay as explain in here: http://arxiv.org/pdf/1602.01783v1.pdf I used the one-step-Q-learning pseudocode, and now we can train the Pong game in less than 20 hours and without any GPU or network distribution.

deep-reinforcement-learning-algorithms icon deep-reinforcement-learning-algorithms

32 projects in the framework of Deep Reinforcement Learning algorithms: Q-learning, DQN, PPO, DDPG, TD3, SAC, A2C and others. Each project is provided with a detailed training log.

ga- icon ga-

遗传算法及其改进

gym-pybullet-drones icon gym-pybullet-drones

PyBullet Gym environments for single and multi-agent reinforcement learning of quadcopter control

js-sorting-algorithm icon js-sorting-algorithm

一本关于排序算法的 GitBook 在线书籍 《十大经典排序算法》,多语言实现。

jsprit icon jsprit

jsprit is a java based, open source toolkit for solving rich vehicle routing problems

lihang icon lihang

Statistical learning methods, 统计学习方法 [李航] 值得反复读. [笔记, 代码, notebook, 参考文献, Errata]

moea-d icon moea-d

An implementation of MOEA/D by Zhang and Li 2007.

networkx- icon networkx-

通过官方文档学习的,欢迎拍砖,共同学习。

neural-networks icon neural-networks

手里一直屯着一本《Python神经网络编程》的书,到现在也没看多少,所以给自己定了一个小目标:再一周内2.17-2.25看完这本书.

poi icon poi

Mirror of Apache POI

py2webwx icon py2webwx

python写的web微信“智能“群发程序

rake-tutorial icon rake-tutorial

A python implementation of the Rapid Automatic Keyword Extraction

ray icon ray

An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.

reinforcementlearning-atarigame icon reinforcementlearning-atarigame

Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advantage Actor-Critic (A3C) Algorithm. This is much superior and efficient than DQN and obsoletes it. Can play on many games

rlcard icon rlcard

Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjong, UNO.

scikit-opt icon scikit-opt

Genetic Algorithm, Particle Swarm Optimization, Simulated Annealing, Ant Colony Optimization Algorithm,Immune Algorithm, Artificial Fish Swarm Algorithm, Differential Evolution and TSP(Traveling salesman) 遗传算法、粒子群算法、模拟退火算法、蚁群算法等

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.