Git Product home page Git Product logo

phones-las's People

Contributors

akademi4eg avatar eugkar avatar ivanhahan avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar

phones-las's Issues

Cross-lingual experiments

Hi,

I am doing some ASR experiments using AFs and I found your work, which is fantastic work.

And I noticed that you mentioned this framework can extend to cross-lingual ASR, Did you already have some papers?

Meanwhile, I am trying to replicate your work, hope it goes well

Thanks in advance

moving to TF 2.x

With all the new things since TF 2.0 release, it may be reasonable to port code to newest version.
Another option is migration to PyTorch. :)

problem with get_ipa

In preprocess_all.py:

            text = get_ipa(text, language, remove_all_diacritics=args.remove_diacritics,
                split_all_diphthongs=args.split_diphthongs)

This raises a huge amount of errors in output, because _postprocess from ipa_utils.py doesn't have remove_all_diacritics param.

Guidance on installing `espeak-ng`

Hi sciforce! This repo looks really nice.

I'm having trouble running this code with TIMIT and, in particular, with configuring the espeak-ng dependency.

Could you add a line to the readme on installing espeak-ng? Does this dependency even support mac?

Need help in infer.py

First of all, I would like to thank you for sharing this great work.

I trained the model successfully based on the instructions in README for articulatory detection in TIMIT.

But, I faced some problems in infer.py.

  1. This is the type of error "operands could not be broadcast together with shapes (62,) (42,) " in this function "segs_phones_to_frame_binf". Because there is 62 attributes and 42 phonemes [39 (+UNK, SOS, EOS)]. The length of logits_binf is only 42.

  2. Also, could you provide us the Notebook for compute the accuracy of attribute detection at frame level.

  3. Finally, in the paper, there are 28 place and manner attributes. My question, which mapping file that you used to calculate the detection accuracy of these 28 only.

Thanks a lot.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.