Git Product home page Git Product logo

edge-whisper's Introduction

edge-whisper

https://edge-ai.yomo.run

Realtime transcribe by running whisper model on geo-distributed cloud.

geo

This showcase demonstrates real-time speech-to-text transcription using the Whisper model. The model is deployed across geographically distributed cloud infrastructure to ensure optimal performance and low latency for users around the world.

Users are automatically directed to the most suitable backend server based on their location. To determine your assigned backend and hardware configuration, simply ping edgeai.yomo.dev and check the returned IP address. Here's an overview of the available backends:

  • 3.66.190.18: run Whisper.cpp inference on AWS Graviton 3 arm-based processor.
  • 20.9.141.176: run Whisper.cpp inference on Azure Ampere arm-based processor.
  • 43.131.34.253: run Whisper inference on NVidia Tesla T4.

By leveraging this geographically distributed architecture, this showcase delivers fast, accurate, and reliable speech transcription for users globally.

Self-hosting

yomo.run edge ai inference demo

To deploy this real-time speech transcription system on your own infrastructure, follow these steps:

  1. Start the frontend: Runpnpm run dev to launch the frontend application, which provides the interface for simultaneous interpretation.
  2. Choose your backend: Backends are located in the ./backends/ directory and are built using YoMo. Each backend targets a specific type of AI infrastructure.
  3. Select and run the appropriate backend script:

Please note: These instructions assume you have the necessary dependencies like Whisper, Whisper.cpp and YoMo Framework installed. Refer to the project documentation for further details.

Development on Arm dev machine

follow instructions to run this demo on Arm-based processor dev machine.

edge-whisper's People

Contributors

fanweixiao avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.