Git Product home page Git Product logo

speaker_recognition_gmm_ubm's Introduction

speaker_recognition_GMM_UBM

A speaker recognition system which uses GMM-UBM for use in an Android application which helps in monitoring patients suffering from Schizophrenia.

Installing dependencies

To install all the dependencies for this project, run the following command,

pip3 install -r requirements.txt

Extracing MFCC from audio


To extract MFCC coefficients from audio samples, put all the audio files in a seperate folder and run the following command,

python3 src/speaker_recognition/extract_mfcc_coefficients.py
--audio_folder <path to the folder which contains audio>
--csv_file_name <name of the csv file that will be created>
--opt combined

Creating Universal Background Model


To run UBM training run the following code,

python3 src/speaker_recognition/speaker_recognition.py 
--csv_file <path to MFCC coefficients file> 
--operation ubm

Map adaptation using the created GMM-UBM model


To run MAP adaptation,

python3 src/speaker_recognition/speaker_recognition.py 
--csv_file <path to MFCC coefficients file> 
--operation map 
--ubm_file <path to the ubm file created after GMM-UBM model creation>

For testing the map adapted model,

python3 src/speaker_recognition/testing_model.py
--map_file_name <path to map adapted .npy file>
--ubm_file_name <path to ubm .npy file>
--test_csv_file <path to the csv file of test speaker>
--N 1500

speaker_recognition_gmm_ubm's People

Contributors

scelesticsiva avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.