Git Product home page Git Product logo

ctc_segmentation's People

Contributors

cornerfarmer avatar lumaku avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

ctc_segmentation's Issues

why did you do in this manner?

Hello, I am very grateful for your work. The result is really good. After reading the paper four or five times, I still feel very confused; especially the equations (1) and (2) in it. The work relies on the undecoded path graph obtained by the encoder model. So how does an encoder-decoder with ctc and attention speech recognition model help the segmentation precision?

Evaluations with OOD data?

This is really interesting, the results look much better than gentle [which is already a very nice tool].
I am curious: have you also evaluated it in a 'completely unlabelled' context?

Reading the paper my understanding is that the unlabelled section is limited to data where every target utterance still has some central kernel of data that does contain a reliable transcription. Then these recordings are prepended/appended with additional audio/speech data.

Have you / are you also looking using this as a means to extend a training corpus with, for instance, ASR hypothesis lattices produced for novel input?

I'm thinking something like a still slightly more structured segue into unsupervised or semi-supervised training like this:

License

Very impressive work!

The repo doesn't include any license file although the files you added to the espnet repo mention Apache 2.0 license. Would it possible to add license to this project?

Thanks.

questions

dear author , I want to use evaluate_segments.py to evaluate my output text grids and the effect, is it useful?

thanks for your reply very much

where is the exp/tedlium2_rnn/cmvn.ark from?

hi,I have run the program according to the README. now I want to use another model in ESPnet Model Zoo, but "cmvn.ark" is not included in the zip file. I wonder how can i get the cmvn.ark, and what is it used for? THX.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.