Light

montrealcorpustools / kalpy Goto Github PK

View Code? Open in Web Editor NEW

7.0 2.0 0.0 89.14 MB

Pybind11 bindings for Kaldi

License: MIT License

CMake 0.29% C++ 80.44% C 0.90% Python 18.37%

kalpy's Introduction

kalpy

Pybind11 bindings for Kaldi for use in Montreal Forced Aligner.

Installation

Kalpy depends on Kaldi being built as shared libraries, and the easiest way to install is via conda-forge:

conda install -c conda-forge kalpy

Kalpy is also available on pip via the kalpy-kaldi package, but as this is only a binding library, it relies on Kaldi shared libraries being available. The KALDI_ROOT environment variable must be set to locate the shared libraries and header files. The easiest way to install the appropriately built kaldi libraries is via conda install -c conda-forge kaldi.

export KALDI_ROOT=/path/to/conda/enviornment
pip install kalpy-kaldi

Usage

Two libraries are installed, _kalpy which contains low level bindings conforming to the original C++ style, and kalpy which is a more pythonic interface for higher level operations. The kalpy package is under heavy development and expansion to expose more functionality.

kalpy's People

Contributors

Stargazers

Watchers

kalpy's Issues

where can I find _pywrapfst.h ?

Hi, I failed to install kalpy by conda install kalpy somehow
then I use python3 setup install at kalpy root dir, after install kaldi toolkit successfully,
but got Error :can't find file _pywrapfst.h,
I checked https://github.com/conda-forge/pynini-feedstock and https://github.com/kylebgorman/pynini/tree/master/extensions
where can I find this file ? thank you !

[BUG] Error during alignment: RuntimeError: kaldi::KaldiFatalError

First of all, thank you for putting in so much work into this wonderful solution. It makes life easier being able to use Python to interface with Kaldi instead of having to string together commands all the time - this is nice!

Debugging checklist

[X --> v0.6.0] Have you updated to latest Kalpy version?

Describe the issue
During an attempt to make a simple function to align text to audio without any temp files (using GmmAligner.align_utterance), I encountered kaldi::KaldiFatalError during alignment, and couldn't trace the issue further.

For Reproducing your issue
All files involved in my testing are in the Drive folder here:

https://drive.google.com/drive/folders/148-z_-i_7U09QY8HgpUoeAE5_84U0zex?usp=sharing
The folder kalpy_testing/gh_raw_files is a copy paste from this repository's testing/data folder.
The Folder kalpy_testing/acoustic_model is the extracted acoustic model files for English (US) ARPA acoustic model v2.0.0
The Folder kalpy_testing/acoustic_model_mfa is the extracted acoustic model files for English MFA acoustic model v2.2.1
The test.py file is the python script I wrote, which you can run to reproduce the errors.

Extract the entire folder somewhere and make sure the file structure is the same as in the Drive folder.

Environment:
Run conda install -c conda-forge kalpy if you don't already have. (I used v0.6.0, since it is the latest version on conda-forge).

Run python test.py

This should reproduce the outputs:

Running custom forced alignment with no temp files...
Initializing lexicon...
Initializing aligner...
Initializing feature generator...
Performing forced alignment...
Computing features...
Compiling training graph...
Performing alignment...
Traceback (most recent call last):
  File "/app/test.py", line 242, in <module>
    align_one(input_wav, input_align_text, feature_generator, training_graph_compiler, aligner)
  File "/app/test.py", line 155, in align_one
    alignment = aligner.align_utterance(
  File "/env/lib/python3.10/site-packages/kalpy/gmm/align.py", line 72, in align_utterance
    ) = gmm_align_compiled(
RuntimeError: kaldi::KaldiFatalError

Additional context
I ran all of this in a GPU enabled Docker container on Ubuntu with Python v3.10.

I was ALMOST able to run the test in the repository test_align_sat_first_pass using the data in the kalpy_testing/gh_raw_files/kaldi folder corresponding to tests/data/kaldi in the repo (this is in the python file in the Drive folder I provided). The alignment was generated, but LexiconCompiler.phones_to_pronunciations failed to run, dumping the following error:

ERROR: SymbolStringToLabels: Symbol "" is not mapped to any integer label in symbol table <unspecified>
ERROR: Failed to compile string `           be          be                                       `, with token_type: symbol

I was NOT able to run the test in the repository test_align, and got the same RuntimeError: kaldi::KaldiFatalError as in my custom function.

I tried with two different dictionary+acoustic combinations:

English MFA dictionary v2.2.1 + English MFA acoustic model v2.2.1
English (US) ARPA dictionary v2.0.0 + English (US) ARPA acoustic model v2.0.0

Again, thank you for your work. I have used MFA for the past several months in projects, and am excited to get kalpy to work for even better interfacing with kaldi!

Installation with pip without conda?

Can we install this without conda?
I was trying pip install kalpy-kaldi that didn't work, also tried to build this with pip3 install .
also python3 setup.py build which was most helpful since it complained about not seeing headers from kaldi:

/opt/kalpy/extensions/chain/pybind_chain.h:6:10: fatal error: chain/chain-datastruct.h: No such file or directory
    6 | #include "chain/chain-datastruct.h"

despite having set

export KALDI_ROOT="/opt/kaldi" 
export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:"$KALDI_ROOT/src/lib"

I'm not using conda because I want to build this inside docker for aarch64, so conda doesn't work very well...

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.