Comments (4)
ADDENDUM: I can manually increase --n-eval-steps to >100 steps, but then the evaluation resets after 100 timesteps.
from rl_reach.
The short answer is no. You should train and evaluate using the same environment. If you notice some oscillations at evaluation time, one solution is to increase the action space as you said by changing action_min
and action_max
in init.py. However you should then re-train.
Regarding your 2nd query, this is normal as the number of evaluation steps is independent of the max_episode_steps=100
in init.py. However it makes sense to set n-eval-steps
as a multiple of max_episode_steps
to evaluate with a finite number of episode. For example, if n-eval-steps = 2000
and max_episode_steps=100
, you will use 20 full episodes to evaluate your trained policy. However it makes sense to evaluate over many episodes mainly if you use non-deterministic evaluation.
from rl_reach.
Thanks for the thorough explanation. As a short follow-up: Where do the action_min and action_max from the init.py overwrite the hard-coded action_min and action max in the environment?
from rl_reach.
action_min and action_max are used to define the boundaries of the action space here
from rl_reach.
Related Issues (8)
- -alg:her gives error AttributeError: 'Box' object has no attribute 'spaces' HOT 4
- evaluate_policy.py - Where to put customEnv for gym registration? HOT 3
- evaluate_policy.py - Cant find 'res_episode_1.csv' HOT 3
- evaluate_policy.py - evaluation of multiple seed cant find 'stats.csv' HOT 3
- forcing joint position and setting joint position HOT 11
- ddpg + her HOT 4
- UnregisteredEnv: No registered env with id: widowx_reacher-v47 HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from rl_reach.