Comments (4)
Hi there! Thank you for the feedback! First of all, you have CUDA, but has it worked at all with other ML libraries/projects? I've had a hell of a time making it work with pytorch, but will update the readme and this issue when I have a more robust process for getting it to work on windows.
Next, some videos have subtitles, but sometimes they're in a bitmap-based format instead of a text-based one. I need to add a feature that can use OCR to turn these subs into text sometime, or put an actual error message in for the meantime.
Finaly, which Coqui model are you using? Since the switch to the @idiap/coqui-ai-TTS fork, I've noticed multispeaker models that have support for a speaker wav have behaved slightly differently. I will look into making this easier to use.
And finally, setting up OCR requires the video_ocr library, which requires py-tesserocr which requires a lot of fiddling and research to set up on Windows. I had to install a wheel file and set a path variable, but haven't added any instructions to make it work yet, because I was thinking of rewriting the library or forking it and switching it to use a more modern, compatible tesseract library.
from weeablind.
You have CUDA, but has it worked at all with other ML libraries/projects?
Hi yes, the GPU has worked for other projects, this is the only one that hasn't
which Coqui model are you using
The default one called tts_models/en/vctk/vits
. which one should I download? I feel like the list is unnecessarily bloated :c
I've never seen a models option on any coqui repository before... the one called 2.0.2 has always worked pretty well, but i don't see it in the options
from weeablind.
I just tried to install it in WSL hoping that it may fix the issues, but, doing
pip install -r requirements-linux310.txt
Gives me this error at the end
× Encountered error while trying to install package.
╰─> wxPython
from weeablind.
Hi, I'm not sure about that particular coqui model, it just lists everyone since one coqui shows as available, grouped by language. Many of which are pretty lackluster. But I'm not sure why having vctk is u selecting the voice. I changed something related to that while updating for xtts so I'll test it again more thoroughly.
And I've never tried with WSL. Does your WSL have a DE that supports gtk applications? It might have something to do with that, because on Linux, you need to have this installed to install wxpython:
-f https://extras.wxpython.org/wxPython4/extras/linux/gtk3/ubuntu-22.04
And tonight I'll probably look into CUDA for Windows, I've run out of storage space pretty hard so reinstalling large things has been a pain these days haha
Thanks for your patience and I'm sorry you're having issues. This is truly my first time managing a project of this scale :O
from weeablind.
Related Issues (20)
- not able to dub HOT 6
- ImportError: cannot import name 'ESpeakNG' from 'espeakng' HOT 9
- Failed to import transformers.generation.utils because of the following error (look up to see its traceback): initialization of _pywrap_checkpoint_reader raised unreported exception
- Error in requirements file HOT 2
- No module named 'wx' error when running after installation HOT 1
- ruamel.yaml==0.18.4 unavailable ==> update of requirements file HOT 1
- Library with incompatible requirements. HOT 1
- Eroor when I run pip install -r requirements-win-310.txt --no-deps HOT 1
- Issue with "julius" Module Not Found Error HOT 1
- Dependency conflicts during installation of project dependencies HOT 4
- TypeError: Descriptors cannot be created directly. | Win11 HOT 1
- [Enhancement] New fully open source TTS with steerable voice characteristics HOT 4
- Windows version first release HOT 14
- AV detections? HOT 2
- Help me by providing links for me and everyone so we can have a easier time. HOT 23
- Dropdown box to select "Speaker voices" not getting assigned any value HOT 5
- xTTS v1.1 error: "Model is not multi-speaker but `speaker` is provided." HOT 1
- xTTS provides a list of voices, making it only possible to do a voice conversion, not a voice clone HOT 1
- setting up take forever :(( HOT 5
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from weeablind.