lizezheng Goto Github PK

followers: 4.0 following: 3.0 repos: 115.0 gists: 0.0

Name: WhiteJunior2

Type: User

Location: Beijing China

WhiteJunior2's Projects

3d-convolutional-speaker-recognition

a-convolutional-recurrent-neural-network-for-real-time-speech-enhancement

A minimum unofficial implementation of the "A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement" (CRN) using PyTorch

adaptive-softmax-pytorch

Adaptive Softmax implementation for PyTorch

adaptivesoftmax

This is an implement of Adaptive Softmax with pytorch.

This is a dataset of speech, music and sound effects that can provide training data for AIGC, AI model training, intelligent audio tool development, and audio applications. The audio dataset is mainly used in speech recognition, speech synthesis, singing voice synthesis, music information retrieval, music generation, sound synthesis, etc

amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

anygpt

Code for "AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling"

articulated-animation

Code for Motion Representations for Articulated Animation paper

asteroid

The PyTorch-based audio source separation toolkit for researchers

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

audiogpt

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

audioldm

AudioLDM: Generate speech, sound effects, music and beyond, with text.

audioldm2

Text-to-Audio/Music Generation

audioldm_eval

This toolbox aims to unify audio generation model evaluation for easier comparison.

audiomae-pytorch

Unofficial PyTorch implementation of Masked Autoencoders that Listen

audioset_tagging_cnn

awd-lstm-lm

LSTM and QRNN Language Model Toolkit for PyTorch

awesome-chatgpt-prompts-zh

ChatGPT 中文调教指南。各种场景使用指南。学习怎么让它听你的话。

awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

awesome-digital-human

A collection of resources on digital human including clothed people digitalization, virtual try-on, and other related directions.

awesome-llm

Awesome-LLM: a curated list of Large Language Model

awesome-speech-enhancement

speech enhancement\speech seperation\sound source localization

bert

TensorFlow code and pre-trained models for BERT

book-text-to-speech

A book about Text-to-Speech (TTS) in Chinese.

chattts

A generative speech model for daily dialogue.

chinese_text_normalization

Chinese text normalization for speech processing

comprehensive-e2e-tts

A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS

conv-tasnet

A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).

convnet-benchmarks

Easy benchmarking of all publicly accessible implementations of convnets with pytorch support

lizezheng Goto Github PK

WhiteJunior2's Projects

Recommend Projects

Recommend Topics

Recommend Org