deryy Goto Github PK
Name: zhengjun
Type: User
Name: zhengjun
Type: User
Scripts for computing common lyrics-to-audio alignment evaluation metrics. Usable evaluation for any token-based alignment (e.g. if token is word, phrase, note, section etc.) User for the evaluation of the MIREX Lyrics-to-audio challenge
A music-driven system for generating apparel display video
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Extract the melody from an audio file and export to MIDI
Notes on Scientific Computing for Biomechanics and Motor Control
This is a Chinese ChatBot using Pytorch on NVIDIA TX1. The code is modified from https://github.com/SeanNaren/deepspeech.pytorch
~~
DTW (Dynamic Time Warping) python module
Lightweight, super fast C/C++ (& Python) library for sequence alignment using edit (Levenshtein) distance.
简单ui聊天界面,慢慢完善
EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System
jQuery plugin for color manipulation and animation support.
Aligns text (lyrics) with monophonic singing voice (audio). The algorithm uses structural segmentation to segment the audio into structures and then uses hidden markov models to obtain alignment within segments. The final alignment is concatenation of time stamps of lyrics within the segments for each song.
Util code, issues, discussions
:musical_score: music segmentation in audio domain
a music segmentation algorithm that I proposed and implemented as my undergraduate project. The basic function is: a song is loaded to the system, the system will calculate the chroma(harmonic) and MFCC(timbre) features of the audio input and find the segmentation label by using similarity matrix. Then output the segmented time information of this song
paper key words mining for zju.
Praat in Python, the Pythonic way
A simple example for extracting a pitch of a voice-track using a python library called librosa. The script is generating smoothed graphs of pitch.
Python implementation for Image Classification based on GMM dictionaries and fisher vectors.
A Python project for generating concatenative synthesis driven representations of audio files based on audio database analysis.
speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription
An Open Source Machine Learning Framework for Everyone
毕业论文小助手:一个翻译英文并将中文结果显示在侧边的PDF阅读器
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.