Git Product home page Git Product logo

ar2-d2's Introduction

AR2-D2: Training a Robot without a Robot

Jiafei Duan$^1$, Yi Ru Wang$^1$, Mohit Shridhar$^1$, Abhimanyu Saighal$^1$, Dieter Fox$^{1,2}$, , Ranjay Krishna$^{1,3}$

$^1$ University of Washington, $^2$ NVIDIA, $^3$ Allen Institute for AI

Project Page | Arxiv

If you find this codebase useful, consider citing:

@article{duan2023ar2,
  title={Ar2-d2: Training a robot without a robot},
  author={Duan, Jiafei and Wang, Yi Ru and Shridhar, Mohit and Fox, Dieter and Krishna, Ranjay},
  journal={arXiv preprint arXiv:2306.13818},
  year={2023}
}

🌟 AR2-D2: Training a Robot without a Robot 🌟

AR2-D2 is a robot demonstrations collection framework in the form of an iOS app that people can use to project an AR robot into the physical world and record a video of themselves manipulating any object whilst simultaneously capturing the essential data modalities for training a real robot

❓ If you have any questions, please contact me at duanj1 [at] cs [dot] washington [dot] edu. ❓

AR2-D2 APP Download here

Installation instruction here

πŸ”¨ AR2-D2 IOS APP Setup

[Click to view]
1. Download all the folders in the  AR2-D2 APP
2. Follow the step-by-step instructions for installation.

AR2-D2 utils

  1. We have provided two interactive notebooks in the utilities folder - one to generate training data from the app's raw output and one to visualise a particular scene in voxel form
  2. To use the data generation notebook, first organise the app's raw output (i.e., the depth images, rgb images, and text) into folders by episode. There should be one top-level directory and several sub-folders, each corresponding to a given episode, like so:
    SceneName
    |
    |__ episode1
    |    |__ (rgb + depth + text files)
    |
    |__ episode2
        ...
    
    Note that the sub-folders must follow the naming convention episode{EPISODE_NUMBER}, where {EPISODE_ NUMBER} starts from 1
  3. Once you have generated the training data using the first notebook, you can visualise each episode in voxel form using the second notebook. Simply specify the name of the directory containing the training data and the episode number.

Acknowledgements

We thank the members of the Robotics State Estimation lab and Krishna’s group for the helpful discussions and feedback on the paper. Jiafei Duan is supported by the National Science Scholarship from The Agency for Science, Technology and Research (A*STAR), Singapore.

Coming soon...

[Click to view]
  • Docker training pipeline
  • PerAct real-world implementation

ar2-d2's People

Contributors

jiafei1224 avatar amyu14 avatar

Stargazers

Freax Ruby avatar Yu Lei avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.