Git Product home page Git Product logo

deepsphinx's People

Contributors

vagrawal avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

deepsphinx's Issues

Installing on Windows

Hello,

In the closing response to issue#3, it is written that deepsphinx was written for python 2.
Tensorflow, however, requires python 3 for installing on Windows.

Has anyone tried to install deepsphinx on windows?

May I also add that the backslashes in the Windows path may interfere
with the input line structure.

Thanks,
Yuval

Cannot find the module "flags"

Hello!

Thank you for all the improvements and documentation.

I installed in Ubuntu
sudo python3 setup.py build
sudo python3 setup.py install

then issued the command:

shell: python3 bin/deepsphinx-infer --checkpoint-path data/batch-21937.data-00000-of-00001 --audio-file ~/speech/23-00-25_217c.wav

There was an error - see below:

Traceback (most recent call last):
File "bin/deepsphinx-infer", line 5, in
from deepsphinx.api import Predict
File "/usr/local/lib/python3.5/dist-packages/deepshinx-0.1-py3.5.egg/deepsphinx/api.py", line 7, in
ImportError: No module named 'flags'

Although the module was in the "build"

shell: find . -name flags*
./build/lib/deepsphinx/flags.py
./deepsphinx/flags.py

The same error occurs also with python3 bin/deepsphinx-infer --help and on Windows.

Thanks,
Yuval

Examples of script invocations for training and inference?

Hello!

Thanks for making the project compatible with python-3 !

In setup.py, change the script name from 'bin/deepsphinx_train' to
'bin/deepsphinx-train' .

Could you please provide basic examples of script invocations for training and inference?

I ran
python bin\deepsphinx-train --trans-file data\ds-input.txt --nouse-train-lm --job-dir data\train5
and got a division by zero, something was probably wrong in the call.

Thanks,
Yuval

INFO:tensorflow:Getting speaker stats
INFO:tensorflow:Starting training
INFO:tensorflow:Epoch completed, saving
INFO:tensorflow:Evaluation started
INFO:tensorflow:Restoring parameters from data\elta\train55\checkpoints/-0
Traceback (most recent call last):
File "bin\deepsphinx-train", line 270, in
tf.app.run(train)
File "C:\Users\user\AppData\Local\Programs\Python\Python35\lib\sitepackages\tensorflow\python\platform\app.py", line 48, in run_sys.exit(main(_sys.argv[:1] + flags_passthrough))
File "bin\deepsphinx-train", line 153, in train
lm_fst)
File "bin\deepsphinx-train", line 79, in run_eval
tot_wer / tot_ev, tot_cer / tot_ev))
ZeroDivisionError: float division by zero

What is the preferred dataset for training?

Hi,
I have checked your repository and willing to do some experiment with it. But as you have mentioned I need to train it first and then use it. If you have a well trained model then please add it to your READ.ME file so that it will be accessible to all.

Else share the training data repository, so that I can do the training with it. I have found something in the tensorflow repository, where the datasets size is 100 GB or 21 GB https://github.com/pannous/tensorflow-speech-recognition#ultimate-goal
I want to confirm whether this is the training data that you used for the training purpose or is there something else. If you have used the training data then how you have utilized it.
Kindly, share your thoughts regarding it.

Error message when training

Hello!

I have trained on the same data from windows and from unix, the data contain 1293 wav files.
I did not create an FST or a language model. A checkpoint directory was created and I used it for
inference.

Surprisingly, when running inference on one of the wav files in the training data - the result was blank.
Could it be related to the lack of language model?

Here is the unix training run, the windows training run gave the same messages (but different files).

Thanks,
     Yuval

shell:~/speech/deepsphinx$ python3 bin/deepsphinx-train --job-dir data --trans-file data/ds-input-unix.txt --nouse-train-lm --batch-size 1293

INFO:tensorflow:Getting speaker stats
INFO:tensorflow:Starting training
INFO:tensorflow:Epoch completed, saving
INFO:tensorflow:Evaluation started
INFO:tensorflow:Restoring parameters from data/checkpoints/batch-0
Traceback (most recent call last):
File "bin/deepsphinx-train", line 248, in
tf.app.run(train)
File "/usr/local/lib/python3.5/dist-packages/tensorflow/python/platform/app.py", line 48, in run
_sys.exit(main(_sys.argv[:1] + flags_passthrough))
File "bin/deepsphinx-train", line 154, in train
lm_fst)
File "bin/deepsphinx-train", line 80, in run_eval
tot_wer / tot_ev, tot_cer / tot_ev))
ZeroDivisionError: float division by zero

Having issue while running the program.

Hello Vishal,
I have the following issue while running the program and I am not able to resolve it. Please help me:

>>> from deepsphinx.api import Predict
>>> ds = Predict(Predict.default_flags(),'batch-21937.data-00000-of-00001')
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "/home/aims-whm/deepsphinx-master/deepsphinx/api.py", line 24, in __init__
    1.0)
  File "/home/aims-whm/deepsphinx-master/deepsphinx/seq2seq_model.py", line 241, in seq2seq_model
    keep_prob)
  File "/home/aims-whm/deepsphinx-master/deepsphinx/seq2seq_model.py", line 51, in encoding_layer
    dtype=tf.float32)
  File "/usr/local/lib/python3.5/dist-packages/tensorflow/python/ops/rnn.py", line 396, in bidirectional_dynamic_rnn
    seq_dim=time_dim, batch_dim=batch_dim)
  File "/usr/local/lib/python3.5/dist-packages/tensorflow/python/ops/rnn.py", line 389, in _reverse
    seq_dim=seq_dim, batch_dim=batch_dim)
  File "/usr/local/lib/python3.5/dist-packages/tensorflow/python/ops/array_ops.py", line 2355, in reverse_sequence
    name=name)
  File "/usr/local/lib/python3.5/dist-packages/tensorflow/python/ops/gen_array_ops.py", line 2633, in reverse_sequence
    batch_dim=batch_dim, name=name)
  File "/usr/local/lib/python3.5/dist-packages/tensorflow/python/framework/op_def_library.py", line 589, in apply_op
    param_name=input_name)
  File "/usr/local/lib/python3.5/dist-packages/tensorflow/python/framework/op_def_library.py", line 60, in _SatisfiesTypeConstraint
    ", ".join(dtypes.as_dtype(x).name for x in allowed_list)))
TypeError: Value passed to parameter 'seq_lengths' has DataType float64 not in list of allowed values: int32, int64

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.