Comments (6)
These are all very good points and it does seem that you can improve the performance of the agent. However, the original intent of the A3C algorithm (as well as DQN and all its predecessors) is to create one algorithm that can play a variety of games. In this case, the StateProcessor is designed for all Atari games.
Your custom feature engineering works fine for Breakout, but does it work for all Atari games without manual intervention?
from reinforcement-learning.
Thank you
Exactly, I want to use specific game knowledge to enhance the training process. If it works, then I will think how to "guess" the game knowledge implicitly.
The problem now is: Performance of DQN (and A3C) is not compatible with the papers! Kindly see
#30
from reinforcement-learning.
from reinforcement-learning.
Hi, feed_dict in Tensorflow seems very slow. Using sess.run to process each state seems not a good idea.
from reinforcement-learning.
How else do you... use Tensorflow?
from reinforcement-learning.
Maybe it does not need to use Tensorflow to process each state. Using numpy or other libraries can avoid use feed_dict frequently.
from reinforcement-learning.
Related Issues (20)
- Why CliffWalkingEnv returns 'is_done=True' when reaching cliff? HOT 2
- Is a line missing in 'MC Control with Epsilon-Greedy Policies Solution.ipynb'? HOT 1
- Why is Chapter 11 excluded? HOT 2
- why DQN use kernel size 8 ?
- Gambler's Problem: 0 Stake Allowed?
- Some question in MC Control with Epsilon-Greedy Policies Solution.ipynb HOT 2
- DQL size error
- Policy Evaluation Exercise Solution Is Wrong HOT 1
- Monte Carlo AssertionError: defaultdict(<function mc_control_importance_sampling.<locals>.<lambda> at 0x7f31699ffe18>, {}) (<class 'collections.defaultdict'>)
- Lecture Slides need an update
- Clarification on DQN testing rewards on Atari games
- DQN Testing Rewards on Atari Games HOT 1
- Reinforcement learning policy HOT 1
- Minor Link fix
- A small correction in "MDPs and Bellman Equations" section
- Typo in: "Model-Free Prediction & Control with Monte Carlo (MC)" section -> "Blackjack Playground.ipynb" file:
- Issue in: reinforcement-learning/MC/MC Prediction Solution.ipynb
- please provide requirements.txt or mention the exact version of packages used.
- demystifying-deep-reinforcement-learning link is broken
- MC Control with Epsilon-Greedy Policies ---Epsilon Value and Best Action prob error HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from reinforcement-learning.