dongsig Goto Github PK
Name: dyang
Type: User
Company: Tencent
Bio: Speech
Location: Shanghai
Name: dyang
Type: User
Company: Tencent
Bio: Speech
Location: Shanghai
Open-source implementation of Google Vizier for hyper parameters tuning
AEC Challenge
Implementation of the paper "One-class Learning Towards Synthetic Voice Spoofing Detection"
Automatic LInguistic Unit Count Estimator (ALICE)
Dual-Adversarial Domain Adaptation for replay spoofing detection in automatic speaker verification.
Implementation of the paper: Replay and Synthetic Speech Detection with Res2Net architecture https://arxiv.org/abs/2010.15006
Our submission to the ASVspoof 2019: Automatic Speaker Verification Spoofing and Countermeasures Challenge
Audio classification is a popular topic, here I implement several models using TenserFlow and Keras.
Audio Masking Methods
Transferring audio features to build models for rare conditions with scarce data
A simple header-only C++ library for reading and writing audio files.
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
A data augmentations library for audio, image, text, and video.
A web application that uses artificial intelligence to automatically label voice datasets with the age of the speaker.
AutoGluon: AutoML Toolkit for Deep Learning
Code to reproduce the results in the paper "Surrogate Source Model Learning for Determined Source Separation"
A curated list of awesome Deepfakes materials
This is the dataset set and code of paper which name is Research of Infant Crying Detection Method Based on Audio and Video Fusion
A Python implementation of global optimization with gaussian processes.
vits2 backbone with bert
Build speech enhancement dataset.
Implementation of Learning Bandwidth Expansion Using Perceptually-Motivated Loss (ICASSP 2019)
Implementation of the CGMM-MVDR beamforming
Complex domain recurrent neural network gating and Stiefel-manifold optimization in TensorFlow, NeurIPS 2018
Collection of scripts to create a dataset of noisy multi-channel reverberant mixtures based on wsj1 and CHiME3 datasets.
Deep neural networks for voice conversion (voice style transfer) in Tensorflow
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
Detect audio deep fakes with bispectral analysis
A python module for making pandas datasets out of drum libraries, and training drum type classification models using a few different methods.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.