Git Product home page Git Product logo

hubeibei007's Projects

nmt icon nmt

TensorFlow Neural Machine Translation Tutorial

panako icon panako

The Panako acoustic fingerprinting system.

pretrained-models.pytorch icon pretrained-models.pytorch

Pretrained ConvNets for pytorch: NASNet, ResNeXt, ResNet, InceptionV4, InceptionResnetV2, Xception, DPN, etc.

pyacoustid icon pyacoustid

Python bindings for Chromaprint acoustic fingerprinting and the Acoustid Web service

pyannote-audio icon pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, speaker embedding

pyannote-metrics icon pyannote-metrics

A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems

pyaudioanalysis icon pyaudioanalysis

Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications

pytorch-handbook icon pytorch-handbook

pytorch handbook是一本开源的书籍,目标是帮助那些希望和使用PyTorch进行深度学习开发和研究的朋友快速入门,其中包含的Pytorch教程全部通过测试保证可以成功运行

pytorch-kaldi icon pytorch-kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

pytorch-unet icon pytorch-unet

Pytorch implementation of the U-Net for image semantic segmentation, with dense CRF post-processing

self-supervised-speech-pretraining-and-representation-learning icon self-supervised-speech-pretraining-and-representation-learning

The S3PRL speech toolkit: self-supervised pre-training and representation learning of Mockingjay, TERA, A-ALBERT, APC, and more to come. With easy-to-use standard downstream evaluation scripts including phone classification, speaker recognition, and ASR. (All in Pytorch!)

shopsign icon shopsign

The Website of Our Shop Sign Dataset (a large-scale natural scene images with Chinese texts)

tacotron icon tacotron

Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.

tacotron-2 icon tacotron-2

DeepMind's Tacotron-2 Tensorflow implementation

tacotron2 icon tacotron2

Tacotron 2 - PyTorch implementation with faster-than-realtime inference

transformer-tts icon transformer-tts

A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"

tts icon tts

Deep learning for Text to Speech

vlfeat icon vlfeat

An open library of computer vision algorithms

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.