Xianke Wang's Projects
Mirror of althttpd repo, the small, simple HTTP server from sqlite https://sqlite.org/althttpd/doc/trunk/althttpd.md
A dataset of 222 digital musical scores aligned with 1068 performances (more than 92 hours) of Western classical piano music.
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
An Open Source Tools for Speaker Recognition
A PyTorch implementation of the Transformer model in "Attention is All You Need".
audioNet for piano transcription
A curated list of different papers and datasets in various areas of audio-visual processing
A curated list of resources for Image and Video Deblurring
课程视频、PPT和源代码:侯捷C++系列;台大郭彦甫MATLAB
a repository for sharing book
This is a repository for the comments of my blog
A C++ library and Vamp plugin implementing the Constant-Q transform of a time-domain signal.
CPP实现的CQT(与Python librosa结果相近)和一个唱歌onset detection模型
Convolutional Recurrent Neural Network (CRNN) for image-based sequence recognition.
Convolutional recurrent network in pytorch
:books: 技术面试必备基础知识、Leetcode、计算机操作系统、计算机网络、系统设计
后台开发基础知识总结(春招/秋招)
Phase-Aware Speech Enhancement with Deep Complex U-Net
深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系[email protected] 版权所有,违权必究 Tan 2018.06
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
My solution to course E6870 (Speech Recognition) of Columbia University.
This is my translation of Chinese document of Eigen