aidman Goto Github PK
Name: Bruce
Type: User
Name: Bruce
Type: User
Lingvo
Machine Learning Yearning book by 🅰️𝓷𝓭𝓻𝓮𝔀 🆖
The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training
This is a repository for Leo Kristopher Piel's Master's thesis. It contains code for the built models and conducted expoeriments
Experimenting Speaker Verification and Recognition with Mistral A.K.A Alize
Implement MLP from Scratch using Python
Sample code for MS Learn module
Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" accepted at Interspeech 2021
NeMo: a toolkit for conversational AI
Efficient Training of Audio Transformers with Patchout
PodcastMix A dataset for separating music and speech in podcasts.
Tensors and Dynamic neural networks in Python with strong GPU acceleration
GPU accelerated implementation of i-vector extractor training using PyTorch. Requires Kaldi for feature extraction and UBM training. An example script is provided for VoxCeleb data.
Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Recurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.
Baseline for the Spoofing-aware Speaker Verification Challenge 2022
Code and models to accompany "Energy Efficient SID for Low-Precision Networks"
Sound analysis/synthesis tools for music applications
This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.
Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP 2020
Estimating the Age, Height, and Gender of a speaker with their speech signal.
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion
A Temporal-Spectral Generative Adversarial Network based End-to-end Packet Loss Concealment for Wideband Speech Transmission
Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2020
Research code for the paper "Training speaker recognition systems with limited data" at https://arxiv.org/abs/2203.14688
Production First and Production Ready Speaker Recognition Toolkit
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.