Comments (7)
You can use gym.wrapper.
Btw, the limitation of Pendulum-v0 defaults to be 200 frames. So what's your gym version? (0.17.1 for me and works fine, at 200th timestep it returns done=True
)
from tianshou.
The reference code:
In [14]: import gym
...: print(gym.__version__)
...: e = gym.make('Pendulum-v0')
...: e.reset()
...: i, d = 0, False
...: while not d:
...: _, _, d, _ = e.step([0.1])
...: i += 1
...: print(i)
0.17.1
200
from tianshou.
I see "done=False“ in sourcecode pendulum.py, you mean "gym.Wrapper" does extra work ( 200th timestep) on Pendulum?
from tianshou.
No, I mean the default gym.make('Pendulum-v0')
will not cause this problem. Manually adding gym.wrapper is the last choice.
from tianshou.
I think setting "done=True" after 200 steps is added by "gym.Wrapper".
In your code above, I add "e.unwrapped" after "e = gym.make('Pendulum-v0')", it will not break the while loop.
from tianshou.
In my one Env, using tianshou.ppo to train the agent always warning "'There are already many steps in an episode. You should add a time limitation to your environment!", but Pendulum doesn't warning. At first I think you did something magic in tianshou's code so I open a issue. Sorry for taking your time, Thanks!
from tianshou.
Yes, unwrapped
action will remove the time limit added by the gym.wrapper.
from tianshou.
Related Issues (20)
- how to run RL using multi-nodes in cluster HOT 1
- Potential confusion about where start timesteps are collected in HL interfaces HOT 4
- Use Altair inside a notebook to display benchmark results
- Does Tianshou truly supports MARL out of the box? HOT 1
- Extend benchmark with mujoco v4 envs
- How can I make action sampling within the range specified by my environment when using onpolicy_trainer? HOT 6
- Document effects of the relations between buffer size, num workers and episode length
- Poetry update the torch versioned from cuda (2.0.1+cu118) to cpu (2.1.1) defaultly on Windows HOT 9
- [question] Why does Tianshou use a replay buffer in on-policy RL algorithms? HOT 1
- ImportError: cannot import name 'Self' from 'typing' (/root/miniconda3/lib/python3.10/typing.py) HOT 1
- ModuleNotFoundError: No module named 'tianshou.highlevel' HOT 2
- Support dict observation spaces in highlevel api
- get_env_attr not working in SubprocVectorEnv? HOT 2
- How to save the log which axis is each epoch not epoch's steps? HOT 2
- Python Bug: lambda function refers only one environment HOT 4
- expected to be in range of [-1, 0], but got 1 HOT 3
- Unable to replicate original PPO performance HOT 7
- Clarification Needed on Implementing Action Masking in DQN with preprocess_fn in Collector
- will add dreamerv3 ?
- Documentation for multi-agent needs fixing
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from tianshou.