Comments (5)
Can you add an environment wrapper (e.g. gym.Wrapper) to change the dict obs to numpy?
from tianshou.
Oh, it's the first time I know there is a wrapper function to do such kind of things. I'll try it. Thank you!
from tianshou.
Btw, the collector supports storing a dictionary. On my side, the network will receive a np.array which includes a list of dict. You can extract your desired data like x = np.array([d['observation'] for d in s])
in your network forwarding. It works fine on my side.
from tianshou.
顺便说一句,收集器支持存储字典。在我这边,网络将收到一个包含字典列表的np.array。您可以像
x = np.array([d['observation'] for d in s])
在网络转发中一样提取所需的数据。它在我这边工作正常。
Hi, could you please tell me how to save the dic in collect?
I read the guides online, but don't know how to realize.
Because the states and rewards come from collector, I cannot save the desired key into buffer.
from tianshou.
@ChenyangRan I add this feature yesterday. Please refer to #38
from tianshou.
Related Issues (20)
- Revisit `Launcher` for starting multiple experiments HOT 1
- Adjust locations of setting the policy in train/eval mode HOT 1
- Change log is chaotic and partly uninformative HOT 2
- how to run RL using multi-nodes in cluster HOT 1
- Potential confusion about where start timesteps are collected in HL interfaces HOT 4
- Use Altair inside a notebook to display benchmark results
- Does Tianshou truly supports MARL out of the box? HOT 1
- Extend benchmark with mujoco v4 envs
- How can I make action sampling within the range specified by my environment when using onpolicy_trainer? HOT 6
- Document effects of the relations between buffer size, num workers and episode length
- Poetry update the torch versioned from cuda (2.0.1+cu118) to cpu (2.1.1) defaultly on Windows HOT 9
- [question] Why does Tianshou use a replay buffer in on-policy RL algorithms? HOT 1
- ImportError: cannot import name 'Self' from 'typing' (/root/miniconda3/lib/python3.10/typing.py) HOT 1
- ModuleNotFoundError: No module named 'tianshou.highlevel' HOT 2
- Support dict observation spaces in highlevel api
- get_env_attr not working in SubprocVectorEnv? HOT 2
- How to save the log which axis is each epoch not epoch's steps? HOT 2
- Python Bug: lambda function refers only one environment HOT 4
- expected to be in range of [-1, 0], but got 1 HOT 3
- Unable to replicate original PPO performance HOT 7
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from tianshou.