Comments (7)
We are still on HSK 1, so probably a bit early to start working on Chinese, we need more understanding of the language, unlike Romanian languages, which all are kind of similar.
But the feedback on how a Chinese model would ideally accept the inputs would be highly appreciated, i.e. how will you input words? Can you ask your fellow Chinese native speakers how they would do it?
- Via pinyin without tones, e.g.
wo de mao tai da le
; - Via pinyin with some tones, e.g.
wo3 de0 mao1 tai4 da4 le0
; - Via simplified 汉字;
I would bet on option 3. But as far as I have learned, the Chinese language has the following issues:
- The same 汉字 has multiple readings depending on context;
- Many 汉字 have similar readings;
This is somehow alleviated by the fact that probably 90% of cases can be just handled using a dictionary of bi- and tri-grams of 汉字, but for some cases some understanding of broader context would be required. Which would require googling in Chinese using Chinese VPN and Chinese websites, which would require at least HSK 3 or 4 knowledge. Chicken or egg problem, I guess.
We are solving / have partially solved the same problem in Russian. But if the community lends a hand - this of course can be done.
from silero-models.
As for the datasets, I think the aishell-3 is one of the most used one with Apache 2.0 license. PaddlePaddle is made by Baidu. As a big tech their choices maybe inspiring. You may want to check the dataset references in https://github.com/PaddlePaddle/PaddleSpeech#model-list
from silero-models.
谢谢你。汉语确实很困难,当时只是想表达一下希望有这个功能。我们一般是输入简体汉字的。拼音只是辅助读音的。一般TTS需求都是文字(简体汉字)转语音的。最后,我一直再用你的VAD,感谢!
Thank you. Chinese is really difficult, and at that time I just wanted to express my desire to have this function. We generally enter simplified Chinese characters. Pinyin is only an auxiliary pronunciation. In general, TTS requires text (simplified Chinese characters) as input to obtain speech. Finally, I have your VAD, it's very useful, thanks! I'm not very familiar with github, should I close this issue?
from silero-models.
A quick search gave me this library
But I would need some input from the community to choose a proper tool. And ideally of course, we would need to build our own if this works.
from silero-models.
Also, we found some high quality TTS data in Chinese, but maybe some Chinese only resources can be pointed out by native speakers.
from silero-models.
We are still on HSK 1, so probably a bit early to start working on Chinese, we need more understanding of the language, unlike Romanian languages, which all are kind of similar.
But the feedback on how a Chinese model would ideally accept the inputs would be highly appreciated, i.e. how will you input words? Can you ask your fellow Chinese native speakers how they would do it?
- Via pinyin without tones, e.g.
wo de mao tai da le
;- Via pinyin with some tones, e.g.
wo3 de0 mao1 tai4 da4 le0
;- Via simplified 汉字;
I would bet on option 3. But as far as I have learned, the Chinese language has the following issues:
- The same 汉字 has multiple readings depending on context;
- Many 汉字 have similar readings;
This is somehow alleviated by the fact that probably 90% of cases can be just handled using a dictionary of bi- and tri-grams of 汉字, but for some cases some understanding of broader context would be required. Which would require googling in Chinese using Chinese VPN and Chinese websites, which would require at least HSK 3 or 4 knowledge. Chicken or egg problem, I guess.
We are solving / have partially solved the same problem in Russian. But if the community lends a hand - this of course can be done.
Personally I'd go with option 2, this is much more promising than the others. The repo https://github.com/mozillazg/python-pinyin/ you have found seems very useful to me. When we learn to read and speak at the school, that's basically what we do. Thank you again for working on this language. It's very nice of you. And I like Silero ❤️
from silero-models.
Let's keep the issue open as a reminder, When we have time, we will do something
from silero-models.
Related Issues (20)
- Feature request - [Tajik Language]
- ❓ Questions / Help / Support HOT 2
- Bug report - [X] HOT 1
- В v4_ru двоится голос у Баи и Ксении HOT 4
- ❓ How to import TTS models in C++ HOT 1
- ❓ Questions / Help / Support
- Bug report - [v4 tts fails on debian 12] HOT 2
- ❓ How to rename saved wav files upon saving?
- Feature request for conversion of the models to coreml
- Bug report - [RunTime Error with latest Jit Spanish Model] HOT 2
- Feature request - [ONNX model of denoise_models]
- ❓ Questions / Help / Support
- Feature request -
- Can not read PT file HOT 1
- Text-Enhancement HOT 2
- Alignment output
- ❓ Questions / Help / Support HOT 1
- Please help me how to use Tamil language ?❓
- How to set model en_v6 while using torch.hub.load ❓ Questions / Help / Support
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from silero-models.