Comments (4)
I've been generating clips ~11s at 44.1k and the GPU is spiking around 27GB. I had to set the batch size etc to low numbers which translates to very slow training times.
from audio-diffusion.
(disclaimer: I not an expert I've just been experimenting)
I did not change the size of the source audio but I noticed increasing the X axis in the mel specs increased the output length. I was using powers of 2 to be safe.
side-note: Google Colab pro subscription will let you access GPUs up to 40GBs
from audio-diffusion.
oof that is a lot, when you create the mel spectograms, do you have to manually chop up your dataset into 11 second clips or does the script do that for you? Also what resolution is your 11 second clip mel spectogram? thanks so much for the quick response
from audio-diffusion.
okay yeah that's good to know...thanks
from audio-diffusion.
Related Issues (20)
- how does the audio_to_images.py file work? HOT 3
- Whether the longer music sample is the repetition of a shorted sample? HOT 1
- NameError: name 'transformers' is not defined upon running model via Gradio HOT 2
- Dataset constriants HOT 16
- High fidelity training? HOT 3
- Training own music samples? HOT 1
- Can I input audio file then generate image HOT 2
- Numpy Error HOT 1
- AttributeError: 'AutoencoderKL' object has no attribute 'sample_size' HOT 3
- teticio/audio-diffusion-256 is really good HOT 1
- multi-gpu training HOT 1
- [Little Feedback] Thank you! :) HOT 2
- is it possible to use the train_unet.py script as a regular ldm? HOT 2
- whats the difference between 256 and 512 dataset HOT 1
- Duration of generated audio HOT 4
- WARNING: audio_to_images: No valid audio files were found error! HOT 3
- Questions on conditional generations HOT 6
- Music generation conditioned on text and music HOT 2
- Request ... No GUI ??? HOT 6
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from audio-diffusion.