Git Product home page Git Product logo

dobb-e's Introduction

preview

Dobb·E

arXiv License Code Style: Black PyTorch

Project webpage · Documentation (gitbooks) · Paper

Authors: Mahi Shafiullah*, Anant Rai*, Haritheja Etukuru, Yiqian Liu, Ishan Misra, Soumith Chintala, Lerrel Pinto

Open-source repository of the hardware and software components of Dobb·E and the associated paper, On Bringing Robots Home

dobb-e.mp4

Abstract

Throughout history, we have successfully integrated various machines into our homes - dishwashers, laundry machines, stand mixers, and robot vacuums are a few of the latest examples. However, these machines excel at performing a single task effectively. The concept of a “generalist machine” in homes - a domestic assistant that can adapt and learn from our needs, all while remaining cost-effective has long been a northstar in robotics that has been steadily pursued for decades. In this work, we initiate a large-scale effort towards this goal by introducing Dobb·E, an affordable yet versatile general-purpose system for learning robotic manipulation within household settings. Dobb·E can learn a new task with only five minutes of a user showing it how to, thanks to a demonstration collection tool (“The Stick”) we built out of cheap parts and iPhones. We use the Stick to collect 13 hours of data in 22 homes of New York City, and train Home Pretrained Representations (HPR). Then, in a novel home environment, with five minutes of demonstrations and fifteen minutes of adapting the HPR model, we show that Dobb·E can reliably solve the task on the Stretch, a mobile robot readily available in the market. Across roughly 30 days of experimentation in homes of New York City and surrounding areas, we test our system in 10 homes, with a total of 109 tasks in different environments, and finally achieve a success rate of 81%. Beyond success percentages, our experiments reveala plethora of unique challenges absent or ignored in lab-robotics, ranging fromeffects of strong shadows, to demonstration quality by non-expert users. With the hope of accelerating research on home robots, and eventually seeing robot butlers in every home, we open-source Dobb·E software stack and models, our data, and our hardware designs.

What's on this repo

Dobb·E is made out of four major components:

  1. A hardware tool, called The Stick, to comfortably collect robotic demonstrations in homes.
  2. A dataset, called Homes of New York (HoNY), with 1.5 million RGB-D frames. collected with the Stick across 22 homes and 216 environments of New York City.
  3. A pretrained lightweight foundational vision model called Home Pretrained Representations (HPR), trained on the HoNY dataset.
  4. Finally, the platform to tie it all together to deploy it in novel homes, where with only five minutes of training data and 15 minutes of fine-tuning HPR, Dobb·E can solve many simple household tasks.

Reflecting this structure, there are four folders in this repo, where:

  1. hardware contains our 3D printable STL files, as well as instructions on how to set up the Stick.
  2. stick-data-collection contains all the necessary software for processing any data you collect on the Stick.
  3. imitation-in-homes contains our code for training a policy on your collected data using our pretrained models, and also the code to pretrain a new model yourself.
  4. robot-server contains the code to be run on the robot to deploy the learned policies.

The primary documentation source is gitbooks at https://docs.dobb-e.com. There are also associated documentations inside each folder's READMEs.

Paper

paper_preview Get it from ArXiv or our website.

Citation

If you find any of our work useful, please cite us!

@misc{shafiullah2023dobbe,
      title={On Bringing Robots Home}, 
      author={Nur Muhammad Mahi Shafiullah and Anant Rai and Haritheja Etukuru and Yiqian Liu and Ishan Misra and Soumith Chintala and Lerrel Pinto},
      year={2023},
      eprint={2311.16098},
      archivePrefix={arXiv},
      primaryClass={cs.RO}
}

dobb-e's People

Contributors

notmahi avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.