Comments (3)
Hi dynamik,
Basically the idea is that if you have a pure random exploration, you will end up with all possible ordered sequences that have uniform probabilities. E.g., if a set of two possible actions {1,2} and two time steps, the sequences {11,12,21,22} have all the same probability 0.25 of being tried out.
For the LongerExplorationPolicy, the unordered sequences have uniform probabilities. So in the exemple, {11} has 0.33 probability, {22} has 0.33 and {12, 21} have together 0.33 (0.17 each). That can be useful in environment such as grid world where the order of the actions does not matter in most situations.
The length parameter should be chosen depending on your environment. Usually you'll have to try a few possibilities empirically and see what works.
Best,
Vincent
from deer.
Hi VinF,
thanks for your quick response!
How do you evaluate the Ornstein-Uhlenbeck-Process in comparison?
Best,
Roman
from deer.
Hi Roman,
Indeed, you could possibly find parallels with the nomenclature in the domain of stochastic processes depending on the setting considered.
from deer.
Related Issues (20)
- ReadTheDocs Link Broken HOT 1
- MemoryError on run_PLE.py example HOT 1
- q_networks.AC_net_keras, q_networks.q_network_keras and q_networks.q_network_theano only use 95mb of GPU HOT 1
- Is there any pre-trained model? HOT 1
- TypeError: _buildDQN() takes exactly 2 arguments (1 given) HOT 1
- AC_net_keras qnetwork.getAllParams() HOT 3
- Action limits are getting exceeded HOT 1
- TRPO algorithm HOT 2
- Conv2D channels_last in the Keras HOT 1
- Error for bleeding edge version installation HOT 1
- [Feature Request] Weight Normalization HOT 6
- How to use LSTM? HOT 4
- DDPG implementation HOT 4
- CRAR continuous action space HOT 2
- Agent not learning for maze environment with CRAR HOT 2
- how can i test my deer model HOT 2
- MG_two_storages HOT 1
- MG example with custom environment HOT 6
- MG two sorages HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from deer.