forwiat Goto Github PK

followers: 2.0 following: 9.0 repos: 151.0 gists: 2.0

Type: User

forwiat's Projects

stargan-voice-conversion

full tensorflow implementation of the paper: StarGAN-VC: Non-parallel many-to-many voice conversion with star generative adversarial networks https://arxiv.org/abs/1806.02169

stargan-voice-conversion-1

This is a pytorch implementation of the paper: StarGAN-VC: Non-parallel many-to-many voice conversion with star generative adversarial networks

styler

Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech, INTERSPEECH 2021

stylespeech

PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation

stylespeech-1

Official implementation of Meta-StyleSpeech and StyleSpeech

styletts2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

tacotron

A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model

tensorflow2.0_resnet

A ResNet(ResNet18, ResNet34, ResNet50, ResNet101, ResNet152) implementation using TensorFlow-2.0.

text-independent-speaker-verification

Text Independent Speaker Verification Using GE2E Loss

tf-kaldi-speaker

Neural speaker recognition/verification system based on Kaldi and Tensorflow

thread_music

Implement of music type recognition by tensorflow. Also can be regraded as mutli-thread queue dataset modules.

tiny_yolo_v3

torchsubband

Pytorch implementation of subband decomposition

trainer

🐸 - A general purpose model trainer, as flexible as it gets

tsne-cuda

GPU Accelerated t-SNE for CUDA with Python bindings

tts-frontend-dataset

TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization

utmos22

UT-Sarulab MOS prediction system using SSL models

vae

a simple vae and cvae from keras

vae_tacotron

Implement of speech synthesis project based on vae tacotron.

vampnet

music generation with masked transformers!

vector-quantize-pytorch

Vector (and Scalar) Quantization, in Pytorch

viscpm

Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列

visqol

Perceptual Quality Estimator for speech and audio

vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

voice_datasets

🔊 A comprehensive list of open-source datasets for voice and sound computing (50+ datasets).

voxceleb_trainer

In defence of metric learning for speaker recognition

vq-ppg-vc

Vector Quantized PPGs based Voice conversion

waveglow-tensorflow

Tensorflow implementation of Nvidia Waveglow

wavenet-classifier

Keras Implementation of Deepmind's WaveNet for Supervised Learning Tasks

xphonebert

XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)

forwiat Goto Github PK

forwiat's Projects

Recommend Projects

Recommend Topics

Recommend Org