sean's Projects
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Swift audio synthesis, processing, & analysis platform for iOS, macOS and tvOS
AudioLDM training, finetuning, evaluation and inference.
Text-to-Audio/Music Generation
Audioldm2 vae part
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
π Text-Prompted Generative Audio Model
Barkify: an unoffical training implementation of Bark TTS by suno-ai
A Tensorflow implementation of CapsNet(Capsules Net) in Hinton's paper Dynamic Routing Between Capsules
to explore relationship chord and scale pitch
A Chinese version of A Neural Parametric Singing Synthesizer
This app recognises 3 hand signs - fist, high five and victory hand [ rock, paper, scissors basically :) ] with live feed camera. It uses a HandSigns.mlmodel which has been trained using Custom Vision from Microsoft.
The IK Analysis plugin integrates Lucene IK analyzer into elasticsearch, support customized dictionary.
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Annotated, understandable, and visually interpretable PyTorch implementations of: VAE, BIRVAE, NSGAN, MMGAN, WGAN, WGANGP, LSGAN, DRAGAN, BEGAN, RaGAN, InfoGAN, fGAN, FisherGAN
Google VR SDK for Unity
Hand Detection using a Custom Vision Model on iOS
An Application that can control a timer with just a Look at your hand. Not Kidding...Seriously.
Almost Real-time Object Detection using Apple's CoreML and YOLO v1 -
pytorch implementation of Lead Sheet Generation and Arrangement
Magenta: Music and Art Generation with Machine Intelligence
MakeNoise_ε€ͺη₯
Models and examples built with TensorFlow
A virtual reality interaction system for unity based on physics.
γδΈζθͺηΆθ―θ¨η解ε¨ιθι’εηεΊη¨γδ½δΈδ»εΊ