Git Product home page Git Product logo

aliang-voice's Projects

moetts icon moetts

Speech synthesis model /inference GUI repo for galgame characters based on Tacotron2, Hifigan and VITS

multilingual_text_to_speech icon multilingual_text_to_speech

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

musicldm icon musicldm

The latent diffusion model for text-to-music generation.

quickvc-voiceconversion icon quickvc-voiceconversion

QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion

spleeter icon spleeter

Deezer source separation library including pretrained models.

stable-audio-metrics icon stable-audio-metrics

Metrics for evaluating music and audio generative models – with a focus on long-form, full-band, and stereo generations.

stopes icon stopes

A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB team.

translation-starter icon translation-starter

translation-starter是一个开源项目,它允许你很快地部署一个应用程序,这个应用可以将任何视频翻译成任何语言,并通过AI技术实现口型与声音的完美同步。如果你需要快速集成视频翻译、声音克隆和口型同步到你的业务或流程中,这个工具可以在15分钟内帮助你搭建起来。

unipunc icon unipunc

The case study and multilingfual performance of ICASSP submission

vall-e icon vall-e

An unofficial PyTorch implementation of the audio LM VALL-E

vits icon vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

vits_chinese icon vits_chinese

vits chinese, tts chinese, tts mandarin 史上训练最简单,音质最好的语音合成系统

voiceme icon voiceme

Repository for the paper: VoiceMe: Personalized voice generation in TTS

voicesmith icon voicesmith

[WIP] VoiceSmith makes training text to speech models easy.

voxpopuli icon voxpopuli

A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation

whisper icon whisper

Robust Speech Recognition via Large-Scale Weak Supervision

zeroth icon zeroth

Kaldi-based Korean ASR (한국어 음성인식) open-source project

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.