Git Product home page Git Product logo

funasr-online-paraformer-test's Introduction

用 FunASR-ONNX 加载 Paraformer 流式模型,实现的低延迟实时语音识别、桌面实时字幕。

实时语音识别

下载模型:

git clone https://www.modelscope.cn/damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-online-onnx.git model

安装依赖(Python3.10+):

pip install -r requirements.txt

使用:确保有麦克风,直接运行脚本 01 从麦克风流式转录.py 即可,在转录的同时,它也会用 UDP 向端口 6009 发送识别结果

若要更改端口,请修改源文件

桌面实时字幕

另外做了一个脚本 03 桌面悬浮字幕.py ,直接运行后,它会从 6009 端口接收 UDP 数据,实时更新在悬浮窗,以此来实现屏幕实时字幕

若要更改端口,请修改源文件

编辑 style.css 可以改变字幕的显示效果,包括字体大小、颜色、背景等

右键拖盘图标,可以:

  • 更新外观,在编辑 style.css 后使用
  • 隐藏、显示悬浮窗
  • 锁定、解锁悬浮窗;当解锁时,可以用鼠标拖动;当锁定时,无法拖动,鼠标事件会穿透
  • 退出

桌面实时字幕显示效果

funasr-online-paraformer-test's People

Contributors

haujetzhao avatar

Stargazers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.