Git Product home page Git Product logo

distributed-speaker-diarization-system's Introduction

Distributed Speaker Diarization System

wercker status Build Status Coverage Status

PJIIT Engineer's Thesis

Working application: http://whowhen.piotrsukiennik.pl

Continuous Integration: http://jenkins.piotrsukiennik.pl Sonar: http://sonar.piotrsukiennik.pl Redmine: http://redmine.piotrsukiennik.pl

Speaker diarization is a problem in computer science of segmenting the recorded speech into fragments corresponding to speakers that were present in that recording.

Using the technologies of web applications, the high performance clustering algorithm and distributing the system, made it possible for unlimited number of users to access the functionalities provided and getting high performance results in multiple parallel requests handling.

Project is divided into maven modules that correspond to distributed web applications responsible for different functionalities in the system.

System workflow:

  • Receiving audio recording from the user in form of an audio file
  • Audio file conversion into one that is supported by the system
  • Speech features extraction from the speech signal to the spectral data with use of MFCC
  • Automatic speaker clustering using the kNN algorithm
  • Splitting the audio file into the segments
  • Sharing the result audio files for users to access and sending an email notification

Project has been developed using following technologies: Maven, Java EE, Spring MVC, Memcached, FFMPEG, Comirva, Weka, jQuery, HTML 5 Audio

distributed-speaker-diarization-system's People

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.