hshi-speech Goto Github PK

followers: 46.0 following: 20.0 repos: 30.0 gists: 0.0

Name: Hao SHI (Fumi)

Type: User

Company: Kyoto University

Bio: A speech processing beginner.

Location: Kyoto, Japan

Hao SHI (Fumi)'s Projects

awesome-bandwidth-extension

This is a curated list of awesome Speech Bandwidth Extension tutorials, papers, libraries, datasets, tools, scripts and results. The purpose of this repo is to organize the world’s resources for speech bandwidth extension, and make them universally accessible and useful.

awesome-speech-enhancement

A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.

cdiffuse

Conditional Diffusion Probabilistic Model for Speech Enhancement

chime4-ser-conv-tasnet-speech-enhancement

This repository is developed for speech enhancement based on Conv-TasNet using ESPNet framework.

crn-multi-resolution

cross-scale-non-local-attention

PyTorch code for our paper "Image Super-Resolution with Cross-Scale Non-Local Attention and Exhaustive Self-Exemplars Mining" (CVPR2020).

crossattentioncontrol

Unofficial implementation of "Prompt-to-Prompt Image Editing with Cross Attention Control" with Stable Diffusion

dccrn-with-various-loss-functions

DCCRN with various loss functions

demucs-1

Code for the paper Hybrid Spectrogram and Waveform Source Separation

dereverberation-toolkit-for-reverb-challenge

Deep Learning Based Monaural Speech Dereverberation Models: Hope We Can Get Better Performance of Dereverberation

dns-challenge

This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.

hshi-speech.github.io

icassp2023

model descriptions

improving-voice-separation-by-incorporating-end-to-end-speech-recognition

Implementing the paper -

interspeech-2023

kaldi_needs

librimix-repo

LibriMix-repo

mos_evaluation

my-dns-old-data

previous created dataset

my_feature_extraction

I have add some feature extraction function for torchaudio.

nomad

NOMAD is a fully unsupervised non-matching reference audio quality metric

one-example-of-feature-maps-of-spectrogram-decomposition

one example of spectrogram decomposition (feature map)

papers-repo-accents-asr

accents asr

research-and-analysis-of-speech-enhancement-or-dereverberation

This repository contains some material of speech enhancement and dereverberation. On the one hand, I summarize this work for my further understanding. On the other hand, I hope that all beginners or masters interested in speech enhancement can ask me questions and make progress together. A lot of my summary is not very good, I hope you put forward corrections!

hshi-speech Goto Github PK

Hao SHI (Fumi)'s Projects

Recommend Projects

Recommend Topics

Recommend Org