Topic: policy-gradient Goto Github
Some thing interesting about policy-gradient
Some thing interesting about policy-gradient
policy-gradient,Deep Reinforcement Learning Algorithms implemented with Tensorflow 2.3
User: abhisheksuran
policy-gradient,Highly Modular and Scalable Reinforcement Learning
User: activatedgeek
Home Page: https://torchrl.sanyamkapoor.com
policy-gradient,Paddle-RLBooks is a reinforcement learning code study guide based on pure PaddlePaddle.
Organization: agentmaker
policy-gradient,📚 List of Top-tier Conference Papers on Reinforcement Learning (RL),including: NeurIPS, ICML, AAAI, IJCAI, AAMAS, ICLR, ICRA, etc.
User: allenpandas
policy-gradient,Tutorial4RL: Tutorial for Reinforcement Learning. 强化学习入门教程.
User: allenpandas
policy-gradient,A curated list of Monte Carlo tree search papers with implementations.
User: benedekrozemberczki
policy-gradient,Tutorials for reinforcement learning in PyTorch and Gym by implementing a few of the popular algorithms. [IN PROGRESS]
User: bentrevett
policy-gradient,PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
User: cherrypiesexy
policy-gradient,强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/
Organization: datawhalechina
policy-gradient,HandyRL is a handy and simple framework based on Python and PyTorch for distributed reinforcement learning that is applicable to your own environments.
Organization: dena
policy-gradient,Solutions of assignments of Deep Reinforcement Learning course presented by the University of California, Berkeley (CS285) in Pytorch framework
User: erfanmhi
Home Page: http://rll.berkeley.edu/deeprlcourse/
policy-gradient,Keras Implementation of popular Deep RL Algorithms (A3C, DDQN, DDPG, Dueling DDQN)
User: germain-hug
policy-gradient,A collection of various RL algorithms like policy gradients, DQN and PPO. The goal of this repo will be to make it a go-to resource for learning about RL. How to visualize, debug and solve RL problems. I've additionally included playground.py for learning more about OpenAI gym, etc.
User: gordicaleksa
Home Page: https://youtube.com/c/TheAIEpiphany
policy-gradient,This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled by ''Shapley Q-value: A Local Reward Approach to Solve Global Reward Games''.
User: hsvgbkhgbv
policy-gradient,Machine Learning and having it Deep and Structured (MLDS) in 2018 spring
User: jasonyao81000
policy-gradient,강화학습에 대한 기본적인 알고리즘 구현
User: jcwleo
policy-gradient,An experimentation framework for Reinforcement Learning using OpenAI Gym, Tensorflow, and Keras.
User: kengz
policy-gradient,Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".
User: kengz
Home Page: https://slm-lab.gitbook.io/slm-lab/
policy-gradient,Minimal Monte Carlo Policy Gradient (REINFORCE) Algorithm Implementation in Keras
User: keon
policy-gradient,PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
User: khrylx
policy-gradient,Scalable, event-driven, deep-learning-friendly backtesting library
User: kismuz
Home Page: https://kismuz.github.io/btgym/
policy-gradient,A resource for learning about deep learning techniques from regression to LSTM and Reinforcement Learning using financial data and the fitness functions of algorithmic trading
User: liamconnell
policy-gradient,Code for Paper (ReMax: A Simple, Efficient and Effective Reinforcement Learning Method for Aligning Large Language Models)
User: liziniu
policy-gradient,Clean baseline implementation of PPO using an episodic TransformerXL memory
User: marcometer
policy-gradient,Baseline implementation of recurrent PPO using truncated BPTT
User: marcometer
policy-gradient,Structural implementation of RL key algorithms
User: medipixel
Home Page: https://www.medipixel.io/
policy-gradient,A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow
User: mg2033
policy-gradient,Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
User: morvanzhou
Home Page: https://mofanpy.com/tutorials/machine-learning/reinforcement-learning/
policy-gradient,This repository contains model-free deep reinforcement learning algorithms implemented in Pytorch
User: navneet-nmk
policy-gradient,Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
User: nikhilbarhate99
policy-gradient,Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..
User: omerbsezer
policy-gradient,Trust Region Policy Optimization with TensorFlow and OpenAI Gym
User: pat-coady
Home Page: https://learningai.io/projects/2017/07/28/ai-gym-workout.html
policy-gradient,Reinforcement learning tutorials
User: pythonlessons
Home Page: https://pylessons.com/
policy-gradient,"Attention, Learn to Solve Routing Problems!"[Kool+, 2019], Capacitated Vehicle Routing Problem solver
User: rintarooo
policy-gradient,DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)
User: ritchiehuang
policy-gradient,Minimal and Clean Reinforcement Learning Examples
Organization: rlcode
policy-gradient,[파이썬과 케라스로 배우는 강화학습] 예제
Organization: rlcode
policy-gradient,Multi-hop knowledge graph reasoning learned via policy gradient with reward shaping and action dropout
Organization: salesforce
Home Page: https://arxiv.org/abs/1808.10568
policy-gradient,Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math
User: sudharsan13296
Home Page: https://www.amazon.com/dp/1839210680/ref=cm_sw_r_tw_dp_x_0HRDFbW4MN11H
policy-gradient,Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow
User: sudharsan13296
policy-gradient,A simplified PyTorch implementation of "SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient." (Yu, Lantao, et al.)
User: suragnair
policy-gradient,PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
User: sweetice
policy-gradient,Multiple implementations for abstractive text summurization , using google colab
User: theamrzaki
Home Page: https://medium.com/@theamrzaki
policy-gradient,Personal experiments on Reinforcement Learning
User: theolvs
policy-gradient,An elegant PyTorch deep reinforcement learning library.
Organization: thu-ml
Home Page: https://tianshou.org
policy-gradient,Code for "Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner" in ICCV 2017
User: tsenghungchen
Home Page: https://tsenghungchen.github.io/show_adapt_tell/
policy-gradient,DEEp Reinforcement learning framework
User: vinf
policy-gradient,Deep Reinforcement Learning For Sequence to Sequence Models
User: yaserkl
Home Page: https://arxiv.org/abs/1805.09461
policy-gradient,Implementations of Reinforcement Learning Models in Tensorflow
User: yukezhu
policy-gradient,lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.
User: zuoxingdong
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.