Comments (3)
TalkNet models tend to have issues with drawn-out vowels. The only solution is more/cleaner training data.
I was getting vowel conversion errors.
Could you clarify? If you want it to pronounce something a certain way, you can type it in ARPABET between curly braces.
from controllabletalknet.
Would using samples with longer vowels in the dataset help the issues with drawn-out vowels?
"I was also wondering if there's anyway to make or edit the phoneme converter because I was getting vowel conversion errors."
What was I was trying to ask is, Is there any possible way to input the dataset transcription list in ARPABET or edit the ARPABET converter when training voices, because some proper nouns and non english words confuse it.
from controllabletalknet.
The current training notebook doesn't support ARPABET. Longer vowels in the training data should help, as should having more data in general. 15 minutes is the the bare minimum for decent results, and the best pony models have 2+ hours.
from controllabletalknet.
Related Issues (20)
- other gans? HOT 1
- docker version does not work... HOT 1
- is it possible to make a derpy and doctor whooves model please ?
- TalkNet_Training_Offline error HOT 1
- Error: UserWarning: torchaudio C++ extension is not available.
- talknet has issues downloading some models if your new to talknet HOT 2
- is it possible to introduce ditzy doo talking and singing model possibly that would be great for the next update thanks
- Please explain how to use custom models? HOT 1
- "Reduce metallic noise" fails with "Reconstruction VQGAN failed to download" - Where to place VQGAN file? HOT 1
- docker: Error response from daemon: could not select device driver "" with capabilities: [[gpu]]. HOT 2
- "is a directory" error
- Problem with setup.bat HOT 1
- Notebooks not working due to missing modules HOT 1
- ImportError: cannot import name 'get_num_classes' from 'torchmetrics.utilities.data' on step 3 HOT 1
- Exits with NeMo and numpy related errors on fresh arch install
- New Models
- Problem with linux version HOT 1
- Debugging issues in `backward_extractor`
- Where do `train.py` and `config_v1b.json` come from? HOT 1
- Regardless of mew tech add new models
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from controllabletalknet.