manantomar / agent-centric-representations Goto Github PK
View Code? Open in Web Editor NEWCode for "Principled Offline RL in the Presence of Rich Exogenous Information" and "Ignorance is Bliss: Robust Control via Information Gating".
Code for "Principled Offline RL in the Presence of Rich Exogenous Information" and "Ignorance is Bliss: Robust Control via Information Gating".
Eq. 2 in the paper states that the observation x
is merged with random gaussian noise epsilon
, but at line 123 in the infogating file https://github.com/manantomar/agent-centric-representations/blob/main/infogating.py#L123, it seems that the observation is mixed with an all zeros mask.
Combined with the random noise (which is actually dropped after 1000 steps in the UNet), I believe this is instead equivalent to ig(x) + clip(N(ig(x), 0.5), ig(x) - 0.3, ig(x) + 0.3)x
.
Can you help explain the discrepancy? Which version is correct, or am I misunderstanding the code?
Thanks!
Hi,
I am wondering what is the motivation behind using InfoNCE for InfoGating as opposed to vanilla multistep inverse dynamics predictor as in ACRO. Also, did you do comparison between InfoNCE and vanilla predictor?
Thanks for the time!
Hello, in the dmc.make
function, you have distracting_wrapper
(as well as fb_mtenv_dmc
) which is not in your codebase. Where can I get this?
Thank you for the time
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.