Git Product home page Git Product logo

Comments (4)

Bahm9919 avatar Bahm9919 commented on July 3, 2024 1

Check your text cleaners, maybe you use another arphabet or phoneme representation or you didnt include CMU dict.

Thank you for a reply.

Here's the frames I tried (200 to 500)

n200_sid0_sigma0.5.mov
n300_sid0_sigma0.5.mov
n400_sid0_sigma0.5.mov
n500_sid0_sigma0.5.mov
There's roughly 230 frames for the n400 and 290 frames for n500. And they are all still speaking alien language.

from flowtron.

Bahm9919 avatar Bahm9919 commented on July 3, 2024

It's normal, and It's maybe attention problem (or maybe gate problem), How many frames do you provide? Try to provide more or less frames in inference with this options python inference.py -c config.json -f models/flowtron_ljs.pt -w models/waveglow_256channels_universal_v5.pt -t "It is well know that deep generative models have a rich latent space!" -i 0 -n 400 - where 400 number of frames and its default number of frames, try to use 200 or 300. And let me know.

from flowtron.

SornrasakC avatar SornrasakC commented on July 3, 2024

Thank you for a reply.

Here's the frames I tried (200 to 500)

n200_sid0_sigma0.5.mov
n300_sid0_sigma0.5.mov
n400_sid0_sigma0.5.mov
n500_sid0_sigma0.5.mov

There's roughly 230 frames for the n400 and 290 frames for n500.
And they are all still speaking alien language.

from flowtron.

SornrasakC avatar SornrasakC commented on July 3, 2024

So, I tried installing a new fresh repo from this main branch without any of my commit, then fix a few lines in inference.py (remove .half(), change 'bottom' to 'lower' ), I can now generate the sound to exactly what the text input is.

Then I came back to my original repo and was suspecting that my symbols.py was the problem, since I removed all the arpabet
from them.

After adding all the arpabet back, it's now working just like the fresh one.

Thank you for your help @Bahm9919

from flowtron.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.