Comments (2)
Hi, @tomelliot!
Noticed your comment, when was looking at log-uniform function of unreal algorithm.
Actually, I think that code, you suggest, isn't right implementation of log-uniform distribution sampling? and even stackoverflow
answer is not correct. parameters low
and high
are expected to be inside logarithm function, and uniform distribution should sample value between log(low)
and log(high)
.
I reference to the similar, but not the same question and the answer https://math.stackexchange.com/a/1411802
Second question is about training process. Was the problem because of wrong predefined hit reward
or not enough training time?
from unreal.
I couldn't find a decisive source defining "log uniform", just comments on SO and the like. I just went with the answer from SO, but you're correct that the code snippet isn't correct. It should be:
def loguniform(low=0, high=1, size=None): return np.exp(np.random.uniform(np.log(low), np.log(high), size))
Second question is about training process. Was the problem because of wrong predefined hit reward or not enough training time?
IIRC training time wasn't the problem - I ran it for sufficient epochs for the complexity of the problem.
from unreal.
Related Issues (20)
- always display "Map loaded: 'nav_maze_static_01" HOT 2
- Issues in _process_base and _process_pc HOT 1
- About reward prediction task HOT 2
- Value function replay and pixel control not uses LSTM context - why? HOT 5
- Replicating inputs for VR and PC HOT 1
- lstm only have one cell?
- bazel failed to build, not visible from target HOT 3
- Clarification on flag grad_norm_clip HOT 2
- Original RGB image from the lab simulator HOT 1
- Questions about Experience Replay Buffer HOT 1
- Base Process Questions
- Type error in options HOT 1
- Performance across multiple runs
- Feature Control HOT 1
- Builds complete successfully, but no results are shown.
- How to change the env_name
- Resource exhausted: OOM when allocating tensor with shape whe running‘train’
- A question about visualize.py HOT 1
- How to find the data of the training process of the trained model?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from unreal.