A python library for working with praat, textgrids, time aligned audio transcripts, and audio files. It is primarily used for extracting features from and making manipulations on audio files given hierarchical time-aligned transcriptions (utterance > word > syllable > phone, etc).

preprocess

Corpus preprocessing

presumm

code for EMNLP 2019 paper Text Summarization with Pretrained Encoders

pretrain_speech_model

Speech Model Pre-training for End-to-End Spoken Language Understanding

prism-set

Automatically exported from code.google.com/p/prism-set

prob-phoc

Probabilistic PHOC

probabilistic-face-embeddings

Uncertainty-aware Face Representation and Recognition

probabilistic_embeddings

This repositoty [contains / will contain] Python code associated with our Oddyssey paper [put arXiv link here].

prodiff

PyTorch Implementation of ProDiff (ACM-MM'22) with a Extremely-Fast diffusion speech synthesis pipeline

project-currennt-public

CURRENNNT codes and scripts

project-currennt-scripts

This repository contains the scripts to use CURRENNT

pronouncur

PronouncUR: An Urdu Pronunciation Lexicon Generator

propbank-frames

Lexicon of frame files used by Propbank annotation. A searchable, readable version of these files is stored at http://verbs.colorado.edu/propbank/framesets-english-aliases/

propbank-release

The official released annotations, both in .prop pointer format and as conll files. Does not contain the source texts

prosodic-lid-globalphone

MInf project exploring the use of prosodic information in language identification from speech, using the x-vector architecture in Kaldi, on the GlobalPhone dataset.

prosody

Helsinki Prosody Corpus and System for Predicting Prosodic Prominence from Text

proteno

This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to Speech Systems (https://arxiv.org/abs/2104.07777)

entn-at Goto Github PK

Ewald Enzinger's Projects

Recommend Projects

Recommend Topics

Recommend Org