Git Product home page Git Product logo

bing-gpt-voice-assistant's Introduction

Bing-GPT-Voice-Assistant

This is a Python voice assistant that uses both OpenAI's 3.5 turbo model and Bing's model with access to the internet for searchable results. All speech-to-text transcription is done locally using OpenAI's Whisper running on the local machine, and text-to-speech is done with AWS Polly.

Note: You will need to have access to the Bing AI search function to utilize this feature.

Table of Contents

Setup

The setup for this project is relatively simple and has a detailed youtube tutorial, which briefly explains how the program works as well as providing a detailed installation guide.

Demonstration

demonstration

Install and Setup

  1. install ffmpeg from their website, or by running sudo apt update && sudo apt install ffmpeg on Linux

  2. Click the green Code button then Download ZIP. After downloading, unzip the file and copy the path to the folder that has the main.py file in it. Alternatively run git clone https://github.com/Ai-Austin/Bing-GPT-Voice-Assistant.git if you have installed git on your device,

  3. To set up the AI's for response generation, use the steps outlined from the video to setup the cookies.json file and follow the steps for creating an OpenAI key and putting it in the program where prompted.

  4. Install python if it is not already on your system. Scroll down to files and download and install the version compatible for your computer. Windows x86-64 executable installer should work for most windows users (note: The link is to the 3.10.0 version, the latest release 3.11 is not compatible with Whisper)

  5. Open a new terminal window and run the command cd path\you\copied using the path from step 1. Then run pip install -r requirements.txt to install the required packages

Running the Assistant

Simply open a terminal and move to the folder with the code cd path\you\copied or use the terminal from step 5. Then run python main.py to start the assitant. To exit the assistant, press ctrl+c to stop the program (you may need to press it twice if it doesn't shut down).

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.