kunzhou9646 Goto Github PK
Name: Kun Zhou
Type: User
Company: Human Language Technology Lab, NUS
Bio: PhD student in National University of Singapore (NUS).
Twitter: KunZhou65685140
Location: Singapore
Name: Kun Zhou
Type: User
Company: Human Language Technology Lab, NUS
Bio: PhD student in National University of Singapore (NUS).
Twitter: KunZhou65685140
Location: Singapore
*BeaqleJS* provides a framework to create browser based listening tests and is purely based on open web standards like HTML5 and Javascript.
Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"
This is sample website for ICASSP 2021.
This is the code for controllable EVC framework for seen and unseen emotion generation.
This is the demo page.
:computer: :robot: A summary on our attempts at using Deep Learning approaches for Emotional Text to Speech :speaker:
This is the demo of the emotion triangle.
This is the implementation of the Speaker Odyssey 2020 paper " Transforming spectrum and prosody for emotional voice conversion with non-parallel training data".
This repository contains code to replicate results from the ICASSP 2020 paper "StarGAN for Emotional Speech Conversion: Validated by Data Augmentation of End-to-End Emotion Recognition".
This is the implementation of the paper "Emotion Intensity and its Control for Emotional Voice Conversion".
This is the demo page.
PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
REST service to call the Festival text to speech application
This is the demo page
Efficient neural speech synthesis
This is the demo page of the paper "Speech Synthesis with Mixed Emotions".
This is the demo page.
Implementation code of non-parallel sequence-to-sequence VC
Parallel-data-free emotional voice conversion with CycleGAN and Continuous Wavelet Transform
Transforming spectrum and prosody for emotional voice conversion with non-parallel training data
深度学习经典、新论文逐段精读
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
PyTorch Tutorial for Deep Learning Researchers
This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage sequence-to-sequence training.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.