Comments (5)
Hi, thank you for the pretrained model.
What type of config.json I should use for training? _agap, _bgap, or _dap?
I got this error:
line 187, in load_data
'emotion': d[3],
Thanks
from radtts.
nice catch! it's because checkpoint_path requires a checkpoint dictionary with an optimizer and iteration number, which we don't provide.
please use warmstart_checkpoint_path instead of checkpoint_path and let us know
python train.py -c config.json -p train_config.ignore_layers_warmstart=["speaker_embedding.weight"] train_config.warmstart_checkpoint_path=model_path.pt
from radtts.
can you please pull and try again? the previous dataloader expected emotion and durations in the filelist. the current works if the filelist has only filename, text and speaker label.
Are you planning to train on new data?
from radtts.
I too would be interested in seeing a sample config for warm starting. I imagine there may be differences in the learning scheduling?
from radtts.
I've had more success training from scratch and warm starting when not ignoring the speaker embedding. Warm starting a multispeaker does not work for me.
from radtts.
Related Issues (20)
- Cannot train starting from pre-trained model b/c audio files not found HOT 1
- Required amount of data and iterations to train the model HOT 5
- Is it possible to do inference in real time?
- Trouble with inferencing without pitch and energy condition HOT 2
- How to slow down the speed of the response? HOT 3
- Here's a Colab notebook for using RADTTS [Documentation]
- Is it a mistake in README? HOT 1
- RuntimeError: element 0 of tensors does not require grad and does not have a grad_fn HOT 1
- Output of voice conversion has source model's timbre, not destination models timbre HOT 2
- Certain texts in LJ speech unloadable HOT 1
- with open(config_path) as f: FileNotFoundError: [Errno 2] No such file or directory: HOT 1
- Inference: size mismatch for context_lstm.weight_ih_l0: copying a param with shape torch.Size([2080, 1044]) from checkpoint, the shape in current model is torch.Size([2080, 1040]). HOT 3
- why mix phone and word embedding HOT 1
- train decatndur HOT 2
- Inference with bgap models
- Question about spectrogram normalization HOT 1
- Training for singing models
- recommend the steps of 1st-stage training.
- Train custom voice instead of the default ljs speaker.
- Straight through on unsupervised aligner
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from radtts.