Git Product home page Git Product logo

o's Introduction

o

o stands for Zero-Shot Autonomous Robots.

o

This repo uses model APIs to create a Zero-Shot Autonomous Robot. Individual robot behaviors are wrapped in asynchronous nodes (python) which are launched via scripts (bash). It's kind of like a more minimalist and simpler ROS. Four main types of models are used:

  • LLM (Language Language Model) a text2text model used for planning, reasoning, dialogue, and more!
  • VLM (Vision Language Model) - a image2text model used for scene understanding, object detection, and more!
  • TTS (Text-to-Speech) a text2audio model used for speech synthesis so the robot can talk.
  • STT (Speech-to-Text) a audio2text model used for speech recognition so the robot can listen.

To get started follow the setup guide.

The models module contains code for different model apis. For example models/rep.py is for the open source Replicate API, and models/gpt.py is for the OpenAI API. More info on models.

The robots module contains code for different robots. For example robots/nex.py is for the HiWonder AiNex Humanoid. More info on robots.

The nodes module contains code for different nodes. For example nodes/look.py contains the loop used vision with a Vision Language Model. More info on nodes.

The params module contains code for different parameters. For example params/default.sh will load environment variables (params) that contain default values. More info on params.

If you are interested in contributing, please read the contributing guide.

Video

YouTube Video

Citation

@misc{zero-shot-robot-2023,
  title={Zero-Shot Autonomous Robots},
  author={Hugo Ponte},
  year={2023},
  url={https://github.com/hu-po/o}
}

o's People

Contributors

hu-po avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.