convo - Enterprise grade voicebot for Humanoid Robots and virtual digital humans

convo works locally and it's free

convo birngs together silero and rasa to create continuous speech conversationalist experience like Alexa or Google dot.

silero STT and TTS models provide the quality comparable to Google's STT (and sometimes even better) but they are not Google. See silero performance benchmarks
rasa is an enterprise-grade chatbot built on python and Transformer based language models providing state-of-the-art framework comparable or better than top cloud based chatbot frameworks

convo can run easily on a local cpu based machine, thus convo provides high response times at no cloud service costs.

Typical STT and TTS inference time on a local machine for one sentence is less than 0.5 seconds each and rasa bot response time is around 1-2 seconds. This can be improved even further by fine tuning and using dedicated machines.

There is some time duration required for the user to speak and for convo to play response audio. This does not count as processing time of convo or any other voicebot.

convo advantages:

High performance as the framework can run locally on a cpu;
No cloud charges so this can be implemented for masses;
highly customizable using rasa custom action server to add any desired functionality;
Can support multiple languages as supported by silero models

convo does not use any hotword detection mechanism however it can stop speaking by speaker requesting with keywords like stop / quit / exit.

Installation and Basics

There are 2 base softwares / frameworks those need to be installed for setting up convo

rasa
silero

rasa Installation steps

Create a python virtual environment named "rasa" with suitable python version mentioned in rasa installation here. Current rasa version 3.x requires python 3.7 or 3.8. enable rasa virtual environment before following below installation steps.
Install rasa using below commands

 pip install --upgrade pip
 
 pip install rasa

Run "rasa init" on the terminal. please follow on screen instructions to complete creating rasa chatbot instance.

 rasa init

Once you have rasa chatbot instance installed you can check if it is working properly by running rasa shell that lets you talk to your assistant on the command line. Note - Change the directory to your rasa instance and run below command.

 rasa shell

This will run rasa server and let you chat with it on terminal. Please enter "/stop" to stop rasa server.

We would be calling this rasa chatbot using rest api call. When we want to communicate with rasa chatbot, we will need to start rasa using

 rasa run --enable-api

silero Installation steps

Create a python virtual environment named "silero" with latest python version
There are quite a few dependencies for running silero. we will go through them in following steps
Install pytorch using instructions on https://pytorch.org/get-started/locally/ - if you are on windows & cpu only, this command may look like below. Use appropriate command line for your machine configuration.

pip install torch torchvision torchaudio

Additionally we need following python packages

pip install PySoundFile SpeechRecognition omegaconf

speechRecognition is a wrapper liberary that allows performing speech recognition using multiple ASR services including google cloud speech etc. We will only be using this liberarty to capture and record audio since it provides detecting voice activity and ending mic recording when user stops speaking.

we also need pyaudio installed. On windows 10-11 you may encounter error installing pyaudio. Please use following commands to install pyaudio in that case.

pip install pipwin 
    
pipwin install pyaudio

convo uses imports from silero those are already included in this repo. please check an ensure that silero model and utils are at the right place

With that we are done with the installation steps. Now try running convo.py in the terminal using silero virtual environment and you should be able to speak with your computer 👍

python convo.py

when you run convo for the first time, it will download silero models to cache. Download Progress will be displayed in the output terminal. In subsequent runs it will use locally cached models which will be fast.

Troubleshooting tips

if you are not able to speak with your computer then try checking below points

Please check if your mic and speaker are enabled. on windows you may also need to check permissions etc.
Please check if all the mentioned liberaries are installed properly and you are running both silero and rasa in their own virtual environments
Please check if are running rasa server from inside of the rasa bot directory using "rasa run --enable-api" and it said rasa server is up and running
Standard laptop mic doesn't often have a great quality that may impact speech recognition quality. Try raising mic input volume level or try using better quality mic.
If you have had some compatibility errors while installing on the virtual environments, you may want to delete and recreate them
In future there might be a change in the avaliable liberaries or compatibility, please do check for those kind of issues.

Citations

This repo builds on top of 2 great softwares

Future enhancements

This repo presents the base working implementation of convo. This can be further enhanced in many ways. Some of the enhancements are mentioned below

Add more functionality to rasa like chitchat, faq and custom api calls
Add more languages and speakers
Perform in momery processing of audio

Feedback and References

If you face any issues or have suggesions, please mention those in issues section
If you liked this repo and/or you were able to use it in your work, please consider starring and mentioning this repo in your ciatations

ashutoshdongare / convo Goto Github PK

convo's Introduction

convo - Enterprise grade voicebot for Humanoid Robots and virtual digital humans

convo works locally and it's free

Installation and Basics

rasa Installation steps

silero Installation steps

Troubleshooting tips

Citations

Future enhancements

Feedback and References

convo's People

Contributors

Stargazers

Watchers

Forkers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent