Git Product home page Git Product logo

Comments (2)

v-nhandt21 avatar v-nhandt21 commented on August 29, 2024

And also which is the best sample rate for input audio? :))

from nisqa.

gabrielmittag avatar gabrielmittag commented on August 29, 2024

Hi, the model is trained to predict the quality of speech that was transmitted via telecommunication systems. It wasn't trained to predict the quality of enhanced speech, so the correlations might not be that high if you apply them to your samples. Regarding the other question I can give following recommendations. Let me know if you have more questions.

  • total duration of audio file: I recommend 6-12 seconds but it should also work for longer or shorter files.

  • proportion of speech and non-speech in the file: I recommend at least 50 %. The model should be able to handle 100% speech.

  • level requirements: The model is trained with an active speech level of -26 dB (according to ITU-T P.56) as default. Speech samples with different level will be judged as loudness degradation and result in a lower MOS score. The following figure shows the overall predicted MOS for different speech levels:
    image

  • suggested SNR for evaluation files (before speech enhancement is applied): I cannot comment on this as the model is not trained for speech enhancement algorithms.

  • best sample rate: The model is trained with 48 kHz but it is able to handle any sample rate. However, the missing frequencies for lower sample rates will be judged as quality degradation by the model and will results in lower MOS scores. Following figure shows the overall MOS for different cut-off frequencies of a low pass filter:
    image

(Figures taken from Deep Learning Based Speech Quality Prediction)

from nisqa.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.