Git Product home page Git Product logo

Comments (8)

BadDeveloper2022 avatar BadDeveloper2022 commented on July 16, 2024

已成功在1.6.20上编译成功。VAD我看你用了简单的VAD算法,能不能用webrtc或者其他好些的算法。

from freeswitch-asr.

BadDeveloper2022 avatar BadDeveloper2022 commented on July 16, 2024

还有一个问题:多线程。
谢谢代码的分享。

from freeswitch-asr.

lionsoul2014 avatar lionsoul2014 commented on July 16, 2024

VAD算法实现都差不多,这个算法我测试过还可以,自己调整其中的一些阈值可以适应不同的应用场景。Freeswitch里面,每次接入模块都会调用open,open里面会创建一个单独的对象,本身适合多线程环境。

from freeswitch-asr.

lionsoul2014 avatar lionsoul2014 commented on July 16, 2024

本身自带的ASR接口是适应yuanyu.ai的,建议你改成自己的http restfull接口,更改下curl的参数就OK了。

from freeswitch-asr.

BadDeveloper2022 avatar BadDeveloper2022 commented on July 16, 2024

你的意思是 不需要单独的线程来处理是吧?我的意思是 http asr请求延迟,会不会影响FreeSwitch 音频处理,并发数高的时候,会不会出现问题。
http 接口已改成自己的了。
上次FreeSwitch群里有人提过这种线程的问题,所以问问的。谢谢你的回复

from freeswitch-asr.

BadDeveloper2022 avatar BadDeveloper2022 commented on July 16, 2024

比如 : 同步的话,http 接口延迟了,但是用户已经说两句话了,下次识别的时候,会不会出现中间的部分丢掉的情况,堵塞的情况。超时设置2000也许避免这样的问题,但是这个不是好的办法吧。你没遇到过这种问题吗。

from freeswitch-asr.

lionsoul2014 avatar lionsoul2014 commented on July 16, 2024

有这个情况的,电话语音交互,网络延时确实没啥好的解决办法,本身同一个通话线程,IO虽然是异步的,但是也不好支持异步处理。建议在交互上做些功夫,例如,在asr请求没有返回之前,可以语音提示正在处理中等,不过这需要深入的更改ASR和Freeswitch容器之间的通讯关系。

from freeswitch-asr.

BadDeveloper2022 avatar BadDeveloper2022 commented on July 16, 2024

我现在的解决办法是 每一次的语音识别 加一个标签,比如:0,1,2,3,4,5,6... 这样可以保证前后循序,处理程序 根据这个标签来 可以判断 第一句话是哪个,第二句话是哪个。
谢谢你的回答。我研究研究。

from freeswitch-asr.

Related Issues (4)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.