Git Product home page Git Product logo

vaksanca's Introduction

Vāksañcayaḥ - Sanskrit speech corpus has more than 78 hours of data and contains recordings of 45,953 sentences with a sampling rate of 22 KHz. The content is mainly readings of various texts spanning many Śāstras of Saṃskṛt literature and also includes contemporary stories, radio program, extempore discourse, etc. The summary datasheet associated with this corpus can be accessed here - Link. Please download the corpus from https://www.cse.iitb.ac.in/~asr/.

Environments

  • python version: 3.7.3
  • Model files
    • List of the speakers used in the train, validation, test and out-of-domain-test split are given in the README file of corpus.
    • SRILM LM link
  • Results for different model
    • In-domain test data WER : 21.94 for the best performing model (SLP1 as the script and BPE splits as the LM unit).
    • Out-of-domain test data WER for different speakers can be referred to in the paper.

Recipe

This Kaldi recipe is based on subword - Vowel Split and Byte Pair Encoding. For word based we used Wall Street Journal recipe

Training

Download the vowel splitter (This requires the text to be in SLP1 format)

Download the pre-trained model

Download the processed dataset

Evaluate

From pre-trained model (SLP vowel split)

./decode.sh test
# | WER : 18.12
./decode.sh truetest
# | WER : 34.88

Publications

Devaraja Adiga and Rishabh Kumar and Amrith Krishna and Preethi Jyothi and Ganesh Ramakrishnan and Pawan Goyal, Automatic Speech Recognition in Sanskrit: A New Speech Corpus and Modelling Insights, In ACL 2021.

vaksanca's People

Contributors

cyfer0618 avatar pdadiga avatar

Stargazers

Karthik Rajgopal avatar  avatar  avatar Tanjiro avatar  avatar Malladi Pradyumna avatar Ashutosh Tripathi avatar Brahma avatar Sai Kasyap avatar Alvin Kimata avatar  avatar Nickolay V. Shmyrev avatar Jingwen avatar  avatar Aditya Yadavalli avatar

Watchers

Nickolay V. Shmyrev avatar  avatar

vaksanca's Issues

Update processed dataset

Currently, In-domain and out-of-domain test data set has only sample files (provided with the first submission of the paper). Can you update both for all (SLP1, Devanagari, word-based, VS, BPE) in the case of In-domain test set and SLP1 with BPE for out-of-domain, which we experimented ASAP? @cyfer0618

How To Use

Hi I am a high school student and very much interested in these things. Could you please tell me how to use it on custom audio inputs.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.