Comments (7)
I also have a similar issue but with 2x 8GB AMD Radeon 5500 XTs in a local Docker container.
from rvc_cli.
I tested it, and this problem extends to Applio, and doesn't happen on the RVC mainline. it seems to be a problem with containers
from rvc_cli.
I don't think it's a container issue. As you mentioned, I ran mainline RVC inside Docker, and multi-GPU training worked there. I believe the problem of parallel training not functioning has been around in Applio for a while. Not everyone has two or more GPUs for machine learning purposes. In fact, @blaisewf is already aware of this but doesn't have additional cards to test other than going to the cloud.
from rvc_cli.
I get it @TheTrustedComputer , I think I was wrong then, and Indeed, not everyone has multiple GPUs, I don't have it myself haha And I think it's impossible to discover the problem just through kaggle
from rvc_cli.
I don't think it's an applio bug, I tried to train with rvc-mainline with a very long dataset, which only allowed me to have a batch size of 7, when I wanted to test the two gpu's of the kaggle, then I got the error of insufficient vram.
-- Bad Traductor
from rvc_cli.
Using one GPU with 8 GB of VRAM, I can train with a batch size of 8 in mainline regardless of the dataset size. However, two GPUs require reducing it to 4, or I'll get out-of-memory errors. Whereas in Applio/RVC_CLI, training doesn't even start with 2+ GPUs.
from rvc_cli.
Fixed
from rvc_cli.
Related Issues (20)
- API - HOT 4
- [BUG] Batch Conversion on Apple Silicon Mac HOT 8
- IP and PORT change for API HOT 3
- [BUG] API infer not work. HOT 3
- [BUG] API won't start HOT 2
- [BUG] Documentation should say env/python.exe for Windows HOT 3
- Stuff to improve GUI wise or ya get what i mean :3 HOT 2
- Training problem HOT 1
- [BUG] Training Threshold set to incorrect value when no value is set in command HOT 2
- [BUG] '<' not supported between instances of 'str' and 'float' HOT 2
- [BUG] '<' not supported between instances of 'str' and 'float' / No output file HOT 10
- Error: 'config' when trying to infer from a model that I trained HOT 1
- Issue with Model Output Generating Noise HOT 5
- [BUG] RVC doesn't produce a usable result with default settings HOT 3
- Docker HOT 1
- [BUG] HOT 1
- [BUG]
- file and folder name are same, "rvc". HOT 1
- Infer in kaggle got error message HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from rvc_cli.