ishine's Projects
An efficient architecture for real-time target sound extraction
C++ Code to run waveglow inference in cuda
Implementation of Google Brain's WaveGrad high-fidelity vocoder (paper: https://arxiv.org/pdf/2009.00713.pdf). First implementation on GitHub.
A fast, high-quality neural vocoder.
PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis
Unofficial Pytorch Implementation of WaveGrad2
WaveGRU vocoder
WaveRNN Vocoder + TTS
A WaveRNN implementation
Code base for WaveTransformer: A novel architecture for automated audio captioning
End-to-End binaural sound localization
This repo contains the implementation of the Wasserstein Barycenter Transport proposed in "Wasserstein Barycenter Transport for Acoustic Adaptation" at ICASSP/2021 (to appear)
Chinese word segmentation model with word-based character embeddings.
WDASnet - Weighted-Directional-Aware Speech Separation Network
a MUSHRA compliant web audio API based experiment software
微信爬虫,获取文章内容、阅读量、点赞量、评论等,获取公众号所有历史文章链接。
高效微信公众号历史文章和阅读数据爬虫powered by scrapy
Production First and Production Ready End-to-End Speech Recognition Toolkit
wenet runtime binding
Production First and Production Ready End-to-End Keyword Spotting Toolkit
wenet模型学习
an applycation of nnlm for wenet