Comments (5)
Hi @saibharani ,
The loss will probably not decrease anymore, but the synthesis will get better and it will not skip words in the future. To get an idea, I've trained the encoder for 300 epochs for the Romanian model and I just reached 190 epochs for the English dataset (1 month+ of training). Just keep training the encoder on your dataset. (you have the --resume option)
How many hours of training data do you have?
from tts-cube.
sorry for the late reply I was on a travel yesterday.
my training data is about 5.5hrs and the pronunciation is good but the only problem is it skips some words and the audio is distorted when the words are skipped do you suggest to use the updated repo or can i continue with the previous one. and if ii update the repo do i have to train it again
from tts-cube.
For 5.5 hours of training data you still requite about 200 epochs for the encoder (if it's single speaker). The new code adds global style tokens and the older models will be no longer supported. So, I suggest you update the code and restart training (re-import might be necessary). I also suggest you switch to 16khz.
Let the encoder train for two-three weeks and check the results then. I've also added support for three vocoders: wavenet, clarinet and waveglow.
Let me know if there is anything else,.
Best,
Tibi
from tts-cube.
ok, thank you. I wiil retrain it with the new code. do you plan on releasing any waveglow model and which has good inference time among the 3 vocoders
from tts-cube.
Yes, I will release a waveglow model. If you check the notebook from colaboratory, it already downloads a partially trained model from a google drive url. I still have to train it for 2-3 weeks, but I will add a permanent link after that.
The best results seem to go for wavenet and waveglow. Clarinet is a little bit muffled
from tts-cube.
Related Issues (20)
- How to use the G2P model HOT 1
- What is BeeCoder? HOT 5
- Negative loss when training step2 HOT 19
- Pretrained text encoder for included IAF model HOT 2
- English model and hardware requirements HOT 26
- Training times HOT 1
- Is there any interest in providing a model trained in Brazilian Portuguese? HOT 2
- Integration with LPCNET HOT 1
- Demo on Colab, possible improvements? HOT 11
- what is the present inference for generating 10sec audio using vocoder? HOT 2
- how to synthesize from melspectogram directly without using encoder HOT 3
- what should the development set's content be in a speech dataset and g2p? HOT 5
- colab notebook missing command to enter the github folder
- How to train non-English?
- How can I synthesize my own text to speech?
- cum folosesc vocea in romana?
- melgan vocoder is fast, let's integrate it? HOT 1
- Install fails on ARCH during pip command HOT 1
- Fine-tuning/Speaker adaptation HOT 15
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from tts-cube.