Git Product home page Git Product logo

berom_speech_dataset's Introduction

Berom_Speech_Dataset

this repo is a work in progress and contains Berom Speech data for ML Speech Applications

Downloading

Go to your terminal and enter;

git clone https://github.com/mandeebot/Berom_Speech_Data.git

This adds a folder called "Berom_Speech_Data" which contains the files to your local directory.

Statistics

  • 212 recordings of an average of 20 word length per recording
  • total recording hours

Data Collection

Recording and text Data were collected from a single Berom Male speaker via WhatsApp, hopefully, this is a baseline for berom speech data and as the project grows, the Lig-Aikuma Android app will be used in crowd-sourcing for more Berom Data. It is an easy-to-use app with a good interface for recording and elicitation. It offers 6 modes of usage;

  • Recording
  • Respeaking
  • Translating
  • Elicitation
  • Check
  • Share

Data Preprocessing

Preprocessing involved;

  • validating data for errors and removing corrupt files

One main dataset directory with subdirectories;

  • wav contains the unprocessed recorded files and metadata

Application

The dataset can be used majorly for low-resource speech model experiments or for cross-lingual ASR.

Problems Encountered

-Berom is a low resource language, meaning there is a very very low amount of resources online to supplement this, actively working towards generating more Berom speech data to add to this repo

  • so far the text transcriptions collected D0 NOT have their tonal descriptions represented(diacritcs), this sets this dataset at some disadvantage, as it is very common in the Berom Language to have one word with different meanings, the different meanings of such a word is often indicated by the tone present in the word. Working towards updating this repo with data that have their tonal descriptions represented
  • Recording speech takes time and can become uninteresting to perform quickly.

Contributing

If you would like to contribute to this project by recording more audio files and transcriptions, You can make a pull request and I will be happy to add you to the project.

Original Author

Mandieng Bot

License

berom_speech_dataset's People

Contributors

mandeebot avatar

Stargazers

 avatar  avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.