Light

chavinlo / riffusion-manipulation Goto Github PK

View Code? Open in Web Editor NEW

79.0 3.0 12.0 23.3 MB

tools to manipulate audio with riffusion

Python 100.00%

ai diffusers diffusion generative-music music stable-diffusion riffusion

riffusion-manipulation's Introduction

📻 What I am listening to: https://www.last.fm/user/hololens

What I worked on previously:

🎞️ TempoFunk, a Text-To-Video Model: https://huggingface.co/TempoFunk/makeavid-sd-jax
🏃💨 SDA, a simplified Stable Diffusion TensorRT pipeline: https://github.com/chavinlo/sda-node
A few more projects (decentralized training, riffusion control, etc...)

HuggingFace: https://huggingface.co/chavinlo

Discord: @qasb

Bye!

riffusion-manipulation's People

Contributors

Stargazers

Watchers

Forkers

nopeanuts aavetis inu-ai jhurliman jprobichaud kandy22 eric-hacker pupubear007 hdparmar xzuyn compstudent

riffusion-manipulation's Issues

--medvram? --opt-split-attention?

is there a chance of using a --medvram or --opt-split-attention? I was trying to use the img2audio.py script to convert a long png to some audio but my card can't allocate enough Vram.

pycharm terminal

not working via pycharm terminal

RuntimeError: Numpy is not available

I have this error when doing img2audio Python 3.10.6. please help

C:\Users\TheGriot\Desktop\application\Converter.venv\lib\site-packages\torchaudio\compliance\kaldi.py:22: UserWarning: Failed to initialize NumPy: module compiled against API version 0x10 but this version of numpy is 0xe (Triggered internally at ..\torch\csrc\utils\tensor_numpy.cpp:77.)
EPSILON = torch.tensor(torch.finfo(torch.float).eps)
C:\Users\TheGriot\Desktop\application\Converter.venv\lib\site-packages\torchaudio\backend\utils.py:62: UserWarning: No audio backend is available.
warnings.warn("No audio backend is available.")
Traceback (most recent call last):
File "C:\Users\TheGriot\Desktop\application\Converter\riffusion-manipulation\img2audio.py", line 155, in
wav_bytes, duration_s = wav_bytes_from_spectrogram_image(image, duration=int(args.duration), nmels=int(args.nmels), maxvol=int(args.maxvol), power_for_image=float(args.powerforimage))
File "C:\Users\TheGriot\Desktop\application\Converter\riffusion-manipulation\img2audio.py", line 112, in wav_bytes_from_spectrogram_image
samples = waveform_from_spectrogram(
File "C:\Users\TheGriot\Desktop\application\Converter\riffusion-manipulation\img2audio.py", line 56, in waveform_from_spectrogram
Sxx_torch = torch.from_numpy(Sxx).to(device)
RuntimeError: Numpy is not available

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.

chavinlo / riffusion-manipulation Goto Github PK

riffusion-manipulation's Introduction

riffusion-manipulation's People

Contributors

Stargazers

Watchers

Forkers

riffusion-manipulation's Issues

--medvram? --opt-split-attention?

pycharm terminal

RuntimeError: Numpy is not available

Feature Request : img2audio batch convert.

Upscale

[feature] Split longer clips to multiple spectograms?

thoughts on sd xl?

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent