Jose Giraldo's Projects
š A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
Thousands of bird sounds visualized using machine learning.
Data manipulation and transformation for audio signal processing, powered by PyTorch
The Hugging Face Course on Transformers for Audio
Collection of notebooks and scripts related to audio processing and machine learning.
Various experiments with the audioset database and tensorflow
Translation of VIP cheatsheets for Machine Learning and Deep Learning
Music ai team projects
DCASE 2017 Baseline system
Experiments with the DCASE framework and database
Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch
This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training and extraction of audio embeddings.
Environmental sound recognition of traffic events such as car, motorcycle, heavy vehicle and horn using librosa and sci-kit learn
Sound Level Meter with ESP32 and I2S MEMS microphone
Functional programming language for signal processing and sound synthesis
Lab Materials for MIT 6.S191: Introduction to Deep Learning
An autoregressive character-level language model for making more things
ICS-43432 mems breakout circular board
Models and examples built with TensorFlow
Instructional notebooks on music information retrieval.
Extraction pipelines and experiments with audio embeddings (Jose's GSoC work, 2021)
p5.sound brings the Processing approach to Web Audio and p5.js. Demos:
PAM is a no-reference audio quality metric for audio generation tasks
Unofficial implementation of NVIDIA P-Flow TTS paper
psysound3 getting backroom surgery to work with MIRtoolbox
Implementing a fractional octave filterbank for python. Based on Numpy and CFFI.