aliang-voice Goto Github PK

repos: 55.0 gists: 0.0

Type: Organization

aliang-voice's Projects

adaspeech

AdaSpeech: Adaptive Text to Speech for Custom Voice

adaspeech2

AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data

attention-is-all-you-need-in-speech-separation

Speech Separation

audio-diffusion

Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

audioeditingcode

audioldm2

Text-to-Audio/Music Generation

audioset_tagging_cnn

AI听曲识歌

awesome-normalizing-flows

Awesome resources on normalizing flows.

awesome-speech-recognition-speech-synthesis-papers

Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

bddm

BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis

bertpunc

SOTA punctation restoration (for e.g. automatic speech recognition) deep learning model based on BERT pre-trained model

chattts

ChatTTS is a generative speech model for daily dialogue.

commu-code

[NeurIPS'22] Official code of "ComMU: Dataset for Combinatorial Music Generation"

cross-speaker-emotion-transfer

PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech