The rsvg from takuseno

📖 About me

💻 Research Scientist @ Sony Research (2020/10/1 - Present)
🎓 Ph.D @ Keio University (2023)
🔥 IPA MITOU super creator (2020)
⌨️ Vimmer (a whole time)
👀 Visit here for more information

🚀 GitHub Projects

As an owner

d3rlpy: An offline deep reinforcement learning library
d4rl-atari: datasets for data-driven deep reinforcement learning with Atari 2600 (wrapper for datasets released by Google)
MINERVA: An out-of-the-box GUI tool for offline deep reinforcement learning

As a contributor

rsvg's People

Contributors

Stargazers

Watchers

rsvg's Issues

Error related to size dismatch when running `play.py`

Hi, thanks for sharing the code.
I am recently trying to implement SVG with your repository as a reference.
I could have run train.py to create the trained model (only created metafile for unknown reason), however when I run play.py, the error related to size mismatch has occurred.
At first, I have changed the observation dimension to the size of train.py has used. But the problem has not resolved in extracting corresponded action value.
I am using TF version 1.4, but I don't think the reason is the difference of version. Does the play.py work your machine?

Assumption that all episodes are of equal number of steps

In the agent.py act() function you assume that all episodes end after a constant amount of steps-- when you call obs_t = np.reshape(obs_t, (-1, self.obs_dim))

Is this a standard practice or something that should change? My simulation's concept of 'done' differs on each run.

Thanks

Recommend Projects

takuseno / rsvg Goto Github PK

rsvg's Introduction

📖 About me

🚀 GitHub Projects

As an owner

As a contributor

rsvg's People

Contributors

Stargazers

Watchers

Forkers

rsvg's Issues

Error related to size dismatch when running `play.py`

Assumption that all episodes are of equal number of steps

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent