jianing-sun / interpolated-policy-gradient-with-ppo-for-robotics-control- Goto Github PK
View Code? Open in Web Editor NEWReinforcement Learning for robotics continuous control, mainly based on Proximal Policy Optimization, extending to Interpolated Policy Gradient and Hindsight Experience Replay (HER)