aliang-voice Goto Github PK

Type: Organization

aliang-voice's Projects

moetts

Speech synthesis model /inference GUI repo for galgame characters based on Tacotron2, Hifigan and VITS

multilingual_text_to_speech

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

musicldm

The latent diffusion model for text-to-music generation.

openvoice

Instant voice cloning by MyShell.

pansori-tedxkr-corpus

Korean ASR Corpus generated from TEDx talks

quickvc-voiceconversion

QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion

speech-synthesis-paper

List of speech synthesis papers.

spleeter

Deezer source separation library including pretrained models.

stable-audio-metrics

Metrics for evaluating music and audio generative models – with a focus on long-form, full-band, and stereo generations.

stable-audio-tools

Generative models for conditional audio generation

stopes

A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB team.

ted_dataset

this is the repo for TED dataset

translation-starter

translation-starter是一个开源项目，它允许你很快地部署一个应用程序，这个应用可以将任何视频翻译成任何语言，并通过AI技术实现口型与声音的完美同步。如果你需要快速集成视频翻译、声音克隆和口型同步到你的业务或流程中，这个工具可以在15分钟内帮助你搭建起来。

unipunc

The case study and multilingfual performance of ICASSP submission

vall-e

An unofficial PyTorch implementation of the audio LM VALL-E

vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

vits_chinese

vits chinese, tts chinese, tts mandarin 史上训练最简单，音质最好的语音合成系统

voice-cloning-create-dataset

Create your own RVC v2 dataset from a youtube video

voice_activity_detection

Voice Activity Detection based on Deep Learning & TensorFlow

voiceme

Repository for the paper: VoiceMe: Personalized voice generation in TTS

voicesmith

[WIP] VoiceSmith makes training text to speech models easy.

voistock_voice_get

voistock站点voicelist页面免费音源检索并下载程序

voxpopuli

A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

zeroth

Kaldi-based Korean ASR (한국어 음성인식) open-source project

aliang-voice Goto Github PK

aliang-voice's Projects

Recommend Projects

Recommend Topics

Recommend Org