Git Product home page Git Product logo

amathislab / dmap Goto Github PK

View Code? Open in Web Editor NEW
14.0 1.0 1.0 24 MB

[NeurIPS 2022] DMAP: a Distributed Morphological Attention Policy for Learning to Locomote with a Changing Body

Home Page: https://amathislab.github.io/dmap/

License: MIT License

HTML 4.77% Python 60.88% Jupyter Notebook 34.35%
distributed-reinforcement-learning proprioception reinforcement-learning sensorimotor-learning motor-control

dmap's Introduction

DMAP: a Distributed Morphological Attention Policy for Learning to Locomote with a Changing Body

We introduce DMAP, a biologically-inspired, attention-based policy network architecture. DMAP combines independent proprioceptive processing, a distributed policy with individual controllers for each joint, and an attention mechanism, to dynamically gate sensory information from different body parts to different controllers (see diagram below). Despite not having access to the (hidden) morphology information, DMAP can be trained end-to-end in all the considered environments, overall matching or surpassing the performance of an oracle agent. Thus DMAP, implementing principles from biological motor control, provides a strong inductive bias for learning challenging sensorimotor tasks. Overall, our work corroborates the power of these principles in challenging locomotion tasks.

This repository presents the code for DMAP.

Check out our website for more details and videos: DMAP

Notes:

  • Dec 2022: Check out this EPFL news article on our work! Also available in French.
  • Nov 2022: For testing our agents, we use a slightly adapted version of the Pybullet-M benchmark. Currently this benchmark code is integrated in our repository, but it will shortly be released as a stand-alone repository (stay tuned). For more information on the benchmark as well as baselines check out:
  • Sept 2022: Paper accepted at NeurIPS 2022!

Mann, K. S., Schneider, S., Chiappa, A., Lee, J. H., Bethge, M., Mathis, A., & Mathis, M. W. (2021, March). Out-of-distribution generalization of internal models is correlated with reward. In Self-Supervision for Reinforcement Learning Workshop-ICLR 2021.

Example body morphologies and agent locomotion performance

Please click on the images to watch them on YouTube.

Example 1 Example 2

Installation and reproducing results

Reproducing the results of the paper

Reference

Chiappa, A.S., Vargas, A.M. and Mathis, A., 2022. DMAP: a Distributed Morphological Attention Policy for Learning to Locomote with a Changing Body. arXiv preprint arXiv:2209.14218.

@article{chiappa2022dmap,
  title={DMAP: a Distributed Morphological Attention Policy for Learning to Locomote with a Changing Body},
  author={Chiappa, Alberto Silvio and Vargas, Alessandro Marin and Mathis, Alexander},
  journal={arXiv preprint arXiv:2209.14218},
  year={2022}
}

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.