florianeagox / weeablind Goto Github PK
View Code? Open in Web Editor NEWA program to dub non-english media with modern AI speech synthesis, diarization, and voice cloning!
Home Page: https://tessapainter.com/project/WeeaBlind
A program to dub non-english media with modern AI speech synthesis, diarization, and voice cloning!
Home Page: https://tessapainter.com/project/WeeaBlind
Exception in thread Thread-17 (run_dubbing):
Traceback (most recent call last):
File "C:\Users\Tanuj\AppData\Local\Programs\Python\Python310\lib\threading.py", line 1009, in _bootstrap_inner
self.run()
File "C:\Users\Tanuj\AppData\Local\Programs\Python\Python310\lib\threading.py", line 946, in run
self._target(*self._args, **self._kwargs)
File "C:\freelancer\dub\weeablind\video.py", line 177, in run_dubbing
progress_hook(i+1, "Mixing New Audio")
UnboundLocalError: local variable 'i' referenced before assignment
Hi nice works,i had a issue with this step on Windows 10 with a python environment,please help me to fix this problem "cannot import name 'espeakng' from espeakng"
thanks in advance
pip install -r requirements-win-310.txt --no-deps
Processing c:\users\seth\downloads\tesserocr-2.6.0-cp310-cp310-win_amd64.whl (from -r requirements-win-310.txt (line 222))
ERROR: Could not install packages due to an OSError: [Errno 2] No such file or directory: 'C:\Users\seth\Downloads\tesserocr-2.6.0-cp310-cp310-win_amd64.whl'
i'm on windows with python 3.10
i have installed espeakng and i added it to the path
Traceback (most recent call last):
File "C:\Users\checc\Desktop\weeablind\weeablind.py", line 3, in
from tabs.ConfigureVoiceTab import ConfigureVoiceTab
File "C:\Users\checc\Desktop\weeablind\tabs\ConfigureVoiceTab.py", line 2, in
import app_state
File "C:\Users\checc\Desktop\weeablind\app_state.py", line 1, in
from Voice import Voice
File "C:\Users\checc\Desktop\weeablind\Voice.py", line 9, in
from espeakng import ESpeakNG
ImportError: cannot import name 'ESpeakNG' from 'espeakng' (C:\Users\checc\AppData\Local\Programs\Python\Python310\lib\site-packages\espeakng_init_.py)
(venv) PS C:\Users\checc\Desktop\weeablind>
When i run weeablind.py - I get this error:
torchvision is not available - cannot save figures
Then it shows that the GPU Is not detected, and that OCR is not supported (Everything else is supported)
Choosing a different coqui voice shows the voice name being selected for a second then the name slowly fades out, and leaves me with no voice selected
([!] Looks like you are using a multi-speaker model. You need to define either a speaker_idx
or a speaker_wav
to use a multi-speaker model.)
Clicking "Run Dubbing" Outputs this error:
UnboundLocalError: local variable 'i' referenced before assignment
Then it locks me into an inescapable Progress bar
Also importing a video gives three errors
Output file does not contain any stream
Error opening output file C:\Apps\weeablind\output\01 Setting Up an Optimized Environment for Drawing.srt.
Error opening output files: Invalid argument
{'status': 'subless'}
(Even though i imported a video with subtitles within it)
I have FFmpeg, MSVC Build Tools, and Cuda all installed... (I also did the setup and all)
(I also had problems with generating subtitles)
Just wanted to bring this new TTS library+model to your attention which allows voice characteristics to be steered via a separate prompt:
https://github.com/huggingface/parler-tts (impressive demos on the HF space linked there).
Afterthought: Wondering whether an LLM might be able to derive such prompts from a pure text transcript...
After following the instructions on windows, I try to run weeblind.py, but get the following error:
Traceback (most recent call last):
File "D:\Weeablind\weeablind.py", line 1, in
import wx
ModuleNotFoundError: No module named 'wx'
After installing wxpython, it gets a bit further, only to give this error:
Traceback (most recent call last):
File "D:\Weeablind\weeablind.py", line 3, in
from tabs.ConfigureVoiceTab import ConfigureVoiceTab
File "D:\Weeablind\tabs\ConfigureVoiceTab.py", line 2, in
import app_state
File "D:\Weeablind\app_state.py", line 1, in
from Voice import Voice
File "D:\Weeablind\Voice.py", line 5, in
import feature_support
File "D:\Weeablind\feature_support.py", line 10, in
diarization_supported = is_module_available("pyannote.audio")
File "D:\Weeablind\feature_support.py", line 6, in is_module_available
return importlib.util.find_spec(module_name) is not None
File "C:\Users\winuser\anaconda3\envs\wee\lib\importlib\util.py", line 94, in find_spec
parent = import(parent_name, fromlist=['path'])
ModuleNotFoundError: No module named 'pyannote'
I have a problem, when installing "pip install -r requirements.txt" it always stops and shows --- "This is taking longer than usual. You might need to provide the dependency resolver with stricter constraints to reduce runtime. See https ://pip.pypa.io/warnings/backtracking for guidance."-- I waited 4, 5 hours but it didn't work
On line 222 of requirements-win-310.txt, you'll see:
file:///C:/Users/seth/Downloads/tesserocr-2.6.0-cp310-cp310-win_amd64.whl#sha256=a31c6eaa6380fd7d4e7764597c4f5a16f6e8f4abf4cdc9d61c52e36ea4f8a850
This brings up an error and stops installation. I could remove it, but I have a feeling it's important. I don't have AMD though, so maybe it's fine? On second thought, I'll replace it with this and hope it works:
tesserocr==2.6.0
Hello esteemed developer. Firstly, I'd like to express my gratitude for creating and maintaining this project. Thanks to individuals like you, OpenSource thrives!
I followed the instructions in the readme, but unfortunately, I still encountered an error.
My ENV
Win 11 x64 - Python 3.10 (from Microsoft Store).
FFmpeg, Espeak-NG, and MSVC Build Tools are installed.
My GPU is an Nvidia RTX 4070 Ti.
Steps to reproduce the error:
Error in the console:
C:\Users\Danil\dev\weeablind\output\sample.wav
espeak [WinError 2] The system cannot find the file specified
espeakng [WinError 2] The system cannot find the file specified
torchvision is not available - cannot save figures
C:\Users\Danil\dev\weeablind\venv\lib\site-packages\pyannote\audio\core\io.py:43: UserWarning: torchaudio._backend.set_audio_backend has been deprecated. With dispatcher enabled, this function is no-op. You can remove the function call.
torchaudio.set_audio_backend("soundfile")
2024-04-03 22:55:50.907235: W tensorflow/stream_executor/platform/default/dso_loader.cc:64] Could not load dynamic library 'cudart64_110.dll'; dlerror: cudart64_110.dll not found
2024-04-03 22:55:50.907388: I tensorflow/stream_executor/cuda/cudart_stub.cc:29] Ignore above cudart dlerror if you do not have a GPU set up on your machine.
Traceback (most recent call last):
File "C:\Users\Danil\dev\weeablind\weeablind.py", line 6, in <module>
from tabs.ListStreams import ListStreamsTab
File "C:\Users\Danil\dev\weeablind\tabs\ListStreams.py", line 3, in <module>
import vocal_isolation
File "C:\Users\Danil\dev\weeablind\vocal_isolation.py", line 4, in <module>
from spleeter.separator import Separator
File "C:\Users\Danil\dev\weeablind\venv\lib\site-packages\spleeter\separator.py", line 26, in <module>
import tensorflow as tf # type: ignore
File "C:\Users\Danil\dev\weeablind\venv\lib\site-packages\tensorflow\__init__.py", line 37, in <module>
from tensorflow.python.tools import module_util as _module_util
File "C:\Users\Danil\dev\weeablind\venv\lib\site-packages\tensorflow\python\__init__.py", line 37, in <module>
from tensorflow.python.eager import context
File "C:\Users\Danil\dev\weeablind\venv\lib\site-packages\tensorflow\python\eager\context.py", line 29, in <module>
from tensorflow.core.framework import function_pb2
File "C:\Users\Danil\dev\weeablind\venv\lib\site-packages\tensorflow\core\framework\function_pb2.py", line 16, in <module>
from tensorflow.core.framework import attr_value_pb2 as tensorflow_dot_core_dot_framework_dot_attr__value__pb2
File "C:\Users\Danil\dev\weeablind\venv\lib\site-packages\tensorflow\core\framework\attr_value_pb2.py", line 16, in <module>
from tensorflow.core.framework import tensor_pb2 as tensorflow_dot_core_dot_framework_dot_tensor__pb2
File "C:\Users\Danil\dev\weeablind\venv\lib\site-packages\tensorflow\core\framework\tensor_pb2.py", line 16, in <module>
from tensorflow.core.framework import resource_handle_pb2 as tensorflow_dot_core_dot_framework_dot_resource__handle__pb2
File "C:\Users\Danil\dev\weeablind\venv\lib\site-packages\tensorflow\core\framework\resource_handle_pb2.py", line 16, in <module>
from tensorflow.core.framework import tensor_shape_pb2 as tensorflow_dot_core_dot_framework_dot_tensor__shape__pb2
File "C:\Users\Danil\dev\weeablind\venv\lib\site-packages\tensorflow\core\framework\tensor_shape_pb2.py", line 36, in <module>
_descriptor.FieldDescriptor(
File "C:\Users\Danil\dev\weeablind\venv\lib\site-packages\google\protobuf\descriptor.py", line 621, in __new__
_message.Message._CheckCalledFromGeneratedFile()
TypeError: Descriptors cannot be created directly.
If this call came from a _pb2.py file, your generated code is out of date and must be regenerated with protoc >= 3.19.0.
If you cannot immediately regenerate your protos, some other possible workarounds are:
1. Downgrade the protobuf package to 3.20.x or lower.
2. Set PROTOCOL_BUFFERS_PYTHON_IMPLEMENTATION=python (but this will use pure-Python parsing and will be much slower).
More information: https://developers.google.com/protocol-buffers/docs/news/2022-05-06#python-updates
Please let me know how I could avoid such an error? Perhaps I did something wrong.
hello, I get an error for espeak, yet all the dependencies are downloaded, the virtual environment is ok.
weeablind\tabs\ConfigureVoiceTab.py", line 1, in
import app_state
File "C:\Users\ravai\Desktop\dubbing\weeablind\app_state.py", line 5, in
speakers[0].set_voice_params('tts_models/en/vctk/vits', 'p326') # p340
File "C:\Users\ravai\Desktop\dubbing\weeablind\Voice.py", line 103, in set_voice_params
self.voice.load_tts_model_by_name(voice)
File "C:\Users\ravai\Desktop\dubbing\weeablind\venv\lib\site-packages\TTS\api.py", line 185, in load_tts_model_by_name self.synthesizer = Synthesizer(
File "C:\Users\ravai\Desktop\dubbing\weeablind\venv\lib\site-packages\TTS\utils\synthesizer.py", line 93, in init
self.load_tts(tts_checkpoint, tts_config_path, use_cuda)
File "C:\Users\ravai\Desktop\dubbing\weeablind\venv\lib\site-packages\TTS\utils\synthesizer.py", line 187, in load_tts
self.tts_model = setup_tts_model(config=self.tts_config)
File "C:\Users\ravai\Desktop\dubbing\weeablind\venv\lib\site-packages\TTS\tts\models_init.py", line 13, in setup_model
model = MyModel.init_from_config(config=config, samples=samples)
File "C:\Users\ravai\Desktop\dubbing\weeablind\venv\lib\site-packages\TTS\tts\models\vits.py", line 1796, in init_from_config
tokenizer, new_config = TTSTokenizer.init_from_config(config)
File "C:\Users\ravai\Desktop\dubbing\weeablind\venv\lib\site-packages\TTS\tts\utils\text\tokenizer.py", line 198, in init_from_config
phonemizer = get_phonemizer_by_name(config.phonemizer, **phonemizer_kwargs)
File "C:\Users\ravai\Desktop\dubbing\weeablind\venv\lib\site-packages\TTS\tts\utils\text\phonemizers_init.py", line 60, in get_phonemizer_by_name
return ESpeak(**kwargs)
File "C:\Users\ravai\Desktop\dubbing\weeablind\venv\lib\site-packages\TTS\tts\utils\text\phonemizers\espeak_wrapper.py", line 114, in init
raise Exception(" [!] No espeak backend found. Install espeak-ng or espeak to your system.")
Exception: [!] No espeak backend found. Install espeak-ng or espeak to your system.
The installation fails on Linux.
Version 18.4 is not available (any more?). Might have been revoked.
File "C:\Users\Reno\voice\weeablind\weeablind.py", line 3, in <module>
from Voice import Voice
File "C:\Users\Reno\voice\weeablind\Voice.py", line 6, in <module>
from espeakng import ESpeakNG
ImportError: cannot import name 'ESpeakNG' from 'espeakng' (C:\Users\Reno\voice\weeablind\venv\Lib\site-packages\espeakng\__init__.py)
Hey! congrats on releasing the app! I've just noticed a slight problem that could really throw off new users from using it
The exe file gets detected as malware by 7 different Anti-Virus engines
Do you have any ideas why this is happening? are you sure that the program you used to craft the exe file is even safe? I've never seen this many false-positives on a simple 8mb file before...
beautiful work! Can we have an usage section in the readme. Can't wait to play with it!
I love this project! Keep up the excellent work. I've wanted something like this for years, but I don't have the skills. I wish I had the skill to help you work on this. There are so many good animes out there that I have been unable to watch because they never got a dub or will never get one. You're doing God's work.
This software is not possible to use on a Mac. In Python version 3.8, an error appears stating that espeakng==1.0.3
requires Python version 3.9 or higher. Unfortunately, when I update to Python 3.9, I get an error saying vidia-cublas-cu12=12.1.3.1
needs Python version lower than 3.9.
ERROR: Ignored the following versions that require a different python version: 1.0.3 Requires-Python >=3.9; 1.2.0 Requires-Python >=3.9
ERROR: Could not find a version that satisfies the requirement espeakng==1.0.3 (from versions: 1.0.1, 1.0.2)
ERROR: No matching distribution found for espeakng==1.0.3
ERROR: Ignored the following versions that require a different python version: 0.52.0 Requires-Python >=3.6,<3.9; 0.52.0rc3 Requires-Python >=3.6,<3.9
ERROR: Could not find a version that satisfies the requirement nvidia-cublas-cu12==12.1.3.1 (from versions: 0.0.1.dev5)
ERROR: No matching distribution found for nvidia-cublas-cu12==12.1.3.1
Hey there,
So, I was trying to install the dependencies for the project, but ran into a bit of a snag. Here's the error message I got:
ERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts.
video-ocr 0.0.2 requires opencv-python~=4.5.5.62, which is not installed.
video-ocr 0.0.2 requires tesserocr~=2.5.2, which is not installed.
flask 3.0.2 requires click>=8.1.3, but you have click 7.1.2 which is incompatible.
video-ocr 0.0.2 requires click~=8.0.1, but you have click 7.1.2 which is incompatible.
video-ocr 0.0.2 requires numpy~=1.22.2, but you have numpy 1.22.0 which is incompatible.
video-ocr 0.0.2 requires Pillow~=9.0.1, but you have pillow 10.2.0 which is incompatible.
video-ocr 0.0.2 requires scipy~=1.8.0, but you have scipy 1.11.3 which is incompatible.
video-ocr 0.0.2 requires tqdm~=4.62.3, but you have tqdm 4.66.2 which is incompatible.
Seems like there are quite a few compatibility issues here. Any idea what's going on? Could we update the dependencies or is there something else we need to do to resolve this?
Cheers!
Hi @FlorianEagox, can you list the libraries that should be installed based on the operating system and running options? At https://stackoverflow.com/questions/77084311, you did not mention any library names or which one should be used for a particular OS.
Hi, are there any instructions on how to go about getting the Windows version working? I am unsure if I’m doing something wrong; it’s saying that FFmpeg isn’t detected. Also, my GPU was not detected.
I ran into a snag while using your project and wanted to reach out about it. Seems like there's an issue with the "julius" module not being found, even though I've installed all the dependencies correctly.
Here's the error I'm getting:
Traceback (most recent call last): File "C:\WINDOWS\system32\weeablind\weeablind.py", line 4, in <module> from tabs.SubtitlesTab import SubtitlesTab ... File "C:\Windows\System32\weeablind\venv\Lib\site-packages\torch_audiomentations\augmentations\band_pass_filter.py", line 1, in <module> import julius ModuleNotFoundError: No module named 'julius'
I double-checked everything in my virtual environment, and it all seems in order. Any ideas on how I can fix this? Any additional steps or configurations I might be missing?
Thanks a bunch for your help!
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.