khaotik / danet-tensorflow Goto Github PK

View Code? Open in Web Editor NEW

90.0 90.0 41.0 85 KB

Tensorflow implementation of "Speaker-independent Speech Separation with Deep Attractor Network"

License: MIT License

Python 96.73% Shell 3.27%

danet-tensorflow's People

Contributors

Stargazers

Watchers

danet-tensorflow's Issues

Constant traning loss ?

We are using this implementation for training a model using our custom dataset with the default network configuration. But it seems that the training loss is constant even after 1600 epochs.

What might be the possible cause ?
And how the loss is calculated ?

NB: We are using CPU machine .

how to download TIMIT and WSJ0 datasets?

Suggested that provide a download link of TIMIT and WSJ0 datasets,thank you!

ops.py syntax invalid

I can't run the main.py due to the error shown in the figure. Can you help me figure out how to solve this ,thank very much~

the model cannot separate the mix speech!

Hello, I am disturbing you. I used my timit dataset to find that the trained model could not separate mixed speech. May I venture to ask, do you have a trained model for reference? Thank you very much!

Training problems

Thank you for your nicely implemented DaNet!

However, I ran into a couple of questions when testing your code. Would you please kindly help me figure them out?

After installing the TIMIT dataset, I ran the timit_1.sh script, but the result using the demo drown from the test set seemed not very good. The model I used is anchor and bilstm-orig. So I guess the timit_1.sh is not meant to be used in such settings?
When you read raw files from timit using scipy.io.wavfile, the format is 16-bit PCM., If you cast the data type into float type and do some processing and then write back the wav, the scipy.io.wave will see that as a 32-bit floating-point type and most of the data will blow the file up(bigger than one). It seems that there will also be a mismatch between training and testing using other format of wavefiles because of the format problems (wavefile.read also has such problems). Not sure if the problem is affected by scipy versions, I've tested them both on scipy 0.9 and 1.0.

As far as I can see, you implementation looks different from the original paper in the following ways

Your input data to the encoder are of variant time steps, which depend on the length of the raw signals. The original paper use trunks of frames of length 100, much shorter than the typical input lengths in your implementation, that might help LSTM to remember things better.
Your data generator may mix up the signals from the same speaker, that might potentially undermine the network to separate the signal based on the tones of the speakers.
The embedding encoder of the original paper has a tanh activation function before spitting out the embedding vectors and your implementation is a linear activation function.

Your implementation really helps me a lot. Looking forward to your reply!!!

Loss cannot decrease

Thanks very much for your code! That really help my a lot. But when I use my dataset, I find that my batch loss cannot decrease. And the demo cannot separate even a little.
Hope you can reply me! Thanks much for your time!

SNR Target Value

I tried with TIMIT and with my own dataset and it works fine (some modifications to do for TIMIT on OSX). At the moment there is no convincing result after a hundred epoch. What is the value of SNR validation to get to have good results?

NameError: global name 'FileNotFoundError' is not defined

I make sure download and install the timit dataset

File "app/datasets/timit.py", line 102, in install_and_load
raise FileNotFoundError(

NameError: global name 'FileNotFoundError' is not defined

IOError: [Errno 2] No such file or directory: 'app/datasets/TIMIT/train_set.pkl'

just some questions

Hello, it's very nice to see such good programs. I am the beginner and I am trying to do the same experiments in this paper, too. But when I run your programs there are some problems. I want to use timit datasets, firstly, but when I run as the 'readme' said, the timit datasets does not install at all. I don't know what operations are wrong. Could you please help me? If it's possible could you please give an e-mail address to me? Thank you so much~

khaotik / danet-tensorflow Goto Github PK

danet-tensorflow's People

Contributors

Stargazers

Watchers

Forkers

danet-tensorflow's Issues

Roadmap

Constant traning loss ?

NB: We are using CPU machine .

how to download TIMIT and WSJ0 datasets?

ops.py syntax invalid

the model cannot separate the mix speech!

Training problems

Loss cannot decrease

SNR Target Value

NameError: global name 'FileNotFoundError' is not defined

just some questions

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent