liao-wk Goto Github PK

followers: 2.0 following: 7.0 repos: 47.0 gists: 0.0

Type: User

liao-wk's Projects

asynchronous-methods-for-deep-reinforcement-learning

Using a paper from Google DeepMind I've developed a new version of the DQN using threads exploration instead of memory replay as explain in here: http://arxiv.org/pdf/1602.01783v1.pdf I used the one-step-Q-learning pseudocode, and now we can train the Pong game in less than 20 hours and without any GPU or network distribution.

breakingthrough-techonologies-data-analysis

心情第一弹，随便写写，哈哈哈哈

chess-alpha-zero

Chess reinforcement learning by AlphaGo Zero methods.

csgo_bot

CSGO bot based on csgo api

deep-reinforcement-learning-algorithms

32 projects in the framework of Deep Reinforcement Learning algorithms: Q-learning, DQN, PPO, DDPG, TD3, SAC, A2C and others. Each project is provided with a detailed training log.

deep-reinforcement-learning-applied-to-doom

DQN, DDDQN, A3C, PPO, Curiosity applied to the game DOOM

deep_reinforcement_learning_on_car_racing_game

Deep Learning Project

differential-evolution

差分进化算法，支持小生境（niche）技术，支持多目标

ga-

遗传算法及其改进

ga-for-cvrp

Capacitated Vehicle Routing Problem

github-api-learning

《python编程从入门到实践》第17章使用API

gym-pybullet-drones

PyBullet Gym environments for single and multi-agent reinforcement learning of quadcopter control

javaml

Java Machine Learning Library

js-sorting-algorithm

一本关于排序算法的 GitBook 在线书籍《十大经典排序算法》，多语言实现。

jsprit

jsprit is a java based, open source toolkit for solving rich vehicle routing problems

lihang

Statistical learning methods, 统计学习方法 [李航] 值得反复读. [笔记, 代码, notebook, 参考文献, Errata]

moea-d

An implementation of MOEA/D by Zhang and Li 2007.

networkx-

通过官方文档学习的，欢迎拍砖，共同学习。

neural-networks

手里一直屯着一本《Python神经网络编程》的书，到现在也没看多少，所以给自己定了一个小目标：再一周内2.17-2.25看完这本书.

poi

Mirror of Apache POI

py2webwx

python写的web微信“智能“群发程序

pygame-learning-environment

PyGame Learning Environment (PLE) -- Reinforcement Learning Environment in Python.

rake-tutorial

A python implementation of the Rapid Automatic Keyword Extraction

ray

An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.

recommendation-algorithm

推荐算法，搞起来。

reinforcementlearning-atarigame

Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advantage Actor-Critic (A3C) Algorithm. This is much superior and efficient than DQN and obsoletes it. Can play on many games

rlcard

Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjong, UNO.

scikit-opt

Genetic Algorithm, Particle Swarm Optimization, Simulated Annealing, Ant Colony Optimization Algorithm,Immune Algorithm, Artificial Fish Swarm Algorithm, Differential Evolution and TSP(Traveling salesman) 遗传算法、粒子群算法、模拟退火算法、蚁群算法等

liao-wk Goto Github PK

liao-wk's Projects

Recommend Projects

Recommend Topics

Recommend Org