Git Product home page Git Product logo

Bruce's Projects

maskspec icon maskspec

The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training

mastersthesis icon mastersthesis

This is a repository for Leo Kristopher Piel's Master's thesis. It contains code for the built models and conducted expoeriments

mtl-speaker-embeddings icon mtl-speaker-embeddings

Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" accepted at Interspeech 2021

nemo icon nemo

NeMo: a toolkit for conversational AI

passt icon passt

Efficient Training of Audio Transformers with Patchout

podcastmix icon podcastmix

PodcastMix A dataset for separating music and speech in podcasts.

pytorch icon pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

pytorch-ivectors icon pytorch-ivectors

GPU accelerated implementation of i-vector extractor training using PyTorch. Requires Kaldi for feature extraction and UBM training. An example script is provided for VoxCeleb data.

pytorch_xvectors icon pytorch_xvectors

Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196

rnn_ctc icon rnn_ctc

Recurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.

sms-tools icon sms-tools

Sound analysis/synthesis tools for music applications

speaker-id icon speaker-id

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

speakerembeddinglosscomparison icon speakerembeddinglosscomparison

Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP 2020

speakerprofiling icon speakerprofiling

Estimating the Age, Height, and Gender of a speaker with their speech signal.

speech-transformer icon speech-transformer

A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.

starganv2-vc icon starganv2-vc

StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion

tfgan-plc icon tfgan-plc

A Temporal-Spectral Generative Adversarial Network based End-to-end Packet Loss Concealment for Wideband Speech Transmission

voxsrc2020 icon voxsrc2020

Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2020

w2v2-speaker-few-samples icon w2v2-speaker-few-samples

Research code for the paper "Training speaker recognition systems with limited data" at https://arxiv.org/abs/2203.14688

wespeaker icon wespeaker

Production First and Production Ready Speaker Recognition Toolkit

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.