Hello Tiberiu, Thanks for making the code for TTS-Cube available, I'm finding it r

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

Training times about tts-cube HOT 1 CLOSED

tiberiu44 commented on June 9, 2024

Training times

from tts-cube.

Comments (1)

tiberiu44 commented on June 9, 2024 1

Hi @roodrallec ,
Glad to hear you find this project useful.

A) Currently, we don't use the G2P model in any way. I found it useful to rather rely on the encoder to learn phonetic transcriptions, because it reflects the distribution of words that are actually used in conversations. Training on CMU (or other lexicons), just adds noise, because the examples in this lexicons don't reflect the actual occurrences of words. Also, there are a lot of exceptions where the pronunciation of words (in terms of phonetics and accent) depends on context.
B) Training the vocoder takes about a month (2 weeks for Wavenet and another 2 weeks for ParallelWavenet)
C) Training the encoder took me three weeks.

I suggest you just start by training the encoder and use the pretrained models for Vocoding. They are multi-speaker and it is likely that you won't have to training anything else. Just one important thing: the models are at 24Khz, so it is required that your data is also sampled at least 24Khz. TTS-Cube will downsample them automatically if it needs to.

You can try, every now and then, to synthesize new samples using the pre-trained models and your vocoder. Just stop when you feel that the results are good enough. As an observation: the encoder model will stay a lot of time in a state where it generated muffled sounds. After a while, the results will improve a lot.

Yes, it is fast enough for real-time TTS, given that you have a GPU. I got good results (for faster then realtime synthesis runtime) on a GTX 1060.

from tts-cube.

Recommend Projects

Training times about tts-cube HOT 1 CLOSED

Comments (1)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent