Comments (2)
Hi there,
Good question! The observation does include the robot's joint angles (for controllable actuators), which is stored in the variable robot_right_joint_positions
in this case above. As for the other elements in the observation, they help the robot infer the state of the world a bit easier. Technically all the robot needs to know about is its joint angles and a target goal, but then that requires the control model (in this case a neural network) to also learn how to do forward kinematics to compute how far the robot's end effector is from some goal. Instead, we just directly compute the forward kinematics to get the robot gripper position/orientation and provide that to the model to make learning a bit easier. For a quick validation of this, you could train simple controllers with and without this information in the observation and check how long it takes for them to learn the task.
self.target_pos
is important to give the robot as an observation since that its the target goal for the robot to reach towards. If that information is not given to the robot, then the robot has no idea where it should move towards. Of course the reward will also be based on this target goal, but the robot only gets 'rewards' during training in simulation, not during test time. In practice, it is quite common for robots to know where their target goal is (otherwise they would just be moving around randomly!). For example, if helping someone eat food, the robot should know where the person's mouth is at each time step, which can be provided by off-the-shelf face detection software.
Hope this helps!
from assistive-gym.
Thanks for the clarifications! I actually realised that Mujoco's reacher environment also uses the same forward kinematic trick.
And yes obviously, the robot must know the target position to be able to learn...
Thanks!
from assistive-gym.
Related Issues (20)
- Error with building custom bullet3 HOT 2
- Strange issue with training cooperative scratch environment HOT 3
- Reproducing Cooperative ItchScratch results HOT 2
- Issue with creating a new environment HOT 2
- Arm Manipulation Google Collaboratory HOT 3
- ValueError when using SAC with co-optimization HOT 1
- About p.setGravity HOT 1
- Issue Create New Environment HOT 1
- Transfer learned policy to actual robot
- Action space for FeedingSawyer HOT 3
- Import Custom 3D Objects HOT 2
- Run trained policies for active human environments on static human environments HOT 2
- Can't load "human" environments HOT 1
- Hi, I can't train ppo on tutorial HOT 1
- Hi, i can't run the env_viewer.py HOT 2
- Add a new type of robot HOT 2
- ('Observation ({}) outside given space ({})!' Error when trying to train model HOT 4
- When I do "env = gym.make", I get an error "PicklingError: Could not pickle object as excessively deep recursion required. HOT 2
- Issue running Assistive_Gym_Basics.ipynb colab notebook HOT 2
- Unable to build Dockerfile
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from assistive-gym.