Great Repository! Is it within your scope to implement a webGPU acce

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Whisper in web-llm with WebGPU? about web-llm HOT 4 OPEN

mlc-ai commented on May 29, 2024

Whisper in web-llm with WebGPU?

from web-llm.

Comments (4)

tqchen commented on May 29, 2024 6

great suggestion, yes this is something that we can push for

from web-llm.

sandorkonya commented on May 29, 2024

@tqchen my ultimate goal would be to get it run the most efficient way on android edge device.

Although there is already a solution in the onnx framework onnx framework, based on the recent merge, but i am not sure when it will be usable on android.

There were some who tried with GPU delegates, but no success yet.

Any idea how one could solve it on the edge (Android) device?

from web-llm.

DustinBrett commented on May 29, 2024

There is also a demo of Whisper running via WebAssembly in that repo. https://github.com/ggerganov/whisper.cpp/tree/master/examples/talk.wasm

from web-llm.

sandorkonya commented on May 29, 2024

There is also a demo of Whisper running via WebAssembly in that repo. https://github.com/ggerganov/whisper.cpp/tree/master/examples/talk.wasm

Yes, it runs on CPU. I hope, that with a GPU version one could reach real time inference.

from web-llm.

Whisper in web-llm with WebGPU? about web-llm HOT 4 OPEN

Comments (4)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent