Git Product home page Git Product logo

Comments (4)

stefanwanckel avatar stefanwanckel commented on July 17, 2024

ADDENDUM: I can manually increase --n-eval-steps to >100 steps, but then the evaluation resets after 100 timesteps.

from rl_reach.

PierreExeter avatar PierreExeter commented on July 17, 2024

The short answer is no. You should train and evaluate using the same environment. If you notice some oscillations at evaluation time, one solution is to increase the action space as you said by changing action_min and action_max in init.py. However you should then re-train.

Regarding your 2nd query, this is normal as the number of evaluation steps is independent of the max_episode_steps=100 in init.py. However it makes sense to set n-eval-steps as a multiple of max_episode_steps to evaluate with a finite number of episode. For example, if n-eval-steps = 2000 and max_episode_steps=100, you will use 20 full episodes to evaluate your trained policy. However it makes sense to evaluate over many episodes mainly if you use non-deterministic evaluation.

from rl_reach.

stefanwanckel avatar stefanwanckel commented on July 17, 2024

Thanks for the thorough explanation. As a short follow-up: Where do the action_min and action_max from the init.py overwrite the hard-coded action_min and action max in the environment?

from rl_reach.

PierreExeter avatar PierreExeter commented on July 17, 2024

action_min and action_max are used to define the boundaries of the action space here

from rl_reach.

Related Issues (8)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.