goodatlas / zeroth Goto Github PK
View Code? Open in Web Editor NEWKaldi-based Korean ASR (한국어 음성인식) open-source project
License: Apache License 2.0
Kaldi-based Korean ASR (한국어 음성인식) open-source project
License: Apache License 2.0
안녕하세요. 한국어 음성인식기를 찾는 도중 이런 좋은 repo를 발견할 수 있어서 영광입니다.
지금 클론해서 세팅 끝내고 테스트를 해보려고 하는데요. 테스트 도중 에러가 나더라구요.
아마 파일에 에러가 있는 것 같은데 혹시 확인 가능하신가요?
Checking data/lang_nosp/oov.{txt, int} ...
--> text seems to be UTF-8 or ASCII, checking whitespaces
--> text contains only allowed whitespaces
--> 1 entry/entries in data/lang_nosp/oov.txt
--> data/lang_nosp/oov.int corresponds to data/lang_nosp/oov.txt
--> data/lang_nosp/oov.{txt, int} are OK--> data/lang_nosp/L.fst is olabel sorted
--> data/lang_nosp/L_disambig.fst is olabel sorted
--> SUCCESS [validating lang directory data/lang_nosp]gzip: data/local/lm/zeroth.lm.tgsmall.arpa.gz: unexpected end of file
arpa2fst --disambig-symbol=#0 --read-symbol-table=data/lang_nosp_test_tgsmall/words.txt - data/lang_nosp_test_tgsmall/G.fst
ERROR (arpa2fst[5.5.433-7637d]:Read():arpa-file-parser.cc:127) line 1 []: \data\ section missing or empty.[ Stack-Trace: ]
/home/slothjuns/kaldi/src/lib/libkaldi-base.so(kaldi::MessageLogger::LogMessage() const+0xb42) [0x7f70e145b692]
arpa2fst(kaldi::MessageLogger::LogAndThrow::operator=(kaldi::MessageLogger const&)+0x21) [0x55d613cc54cf]
/home/slothjuns/kaldi/src/lib/libkaldi-lm.so(kaldi::ArpaFileParser::Read(std::istream&)+0xa8f) [0x7f70e18b310f]
arpa2fst(main+0xd24) [0x55d613cc414e]
/lib/x86_64-linux-gnu/libc.so.6(__libc_start_main+0xe7) [0x7f70e037ab97]
arpa2fst(_start+0x2a) [0x55d613cc334a]kaldi::KaldiFatalError%
The zeroth-LM depends on opensdk7, however, openjdk-7-jdk is no longer officially supported, resulting in installation failure.
What advice!
Thanks!
Refer to site URL: https://github.com/goodatlas/zeroth/wiki/Requirement-2
앱을 다운받고 녹음을 하려는데 인증과정이
되질 않습니다..
I use my account to aws s3 cp s3://zeroth-opensource/AUDIO_INFO AUDIO_INFO
. But have as follow error:
Traceback (most recent call last):
File "/home/kaldi/python3/lib/python3.7/site-packages/awscli/customizations/s3/s3handler.py", line 173, in call
for fileinfo in fileinfos:
File "/home/kaldi/python3/lib/python3.7/site-packages/awscli/customizations/s3/fileinfobuilder.py", line 31, in call
for file_base in files:
File "/home/kaldi/python3/lib/python3.7/site-packages/awscli/customizations/s3/filegenerator.py", line 142, in call
for src_path, extra_information in file_iterator:
File "/home/kaldi/python3/lib/python3.7/site-packages/awscli/customizations/s3/filegenerator.py", line 318, in list_objects
yield self._list_single_object(s3_path)
File "/home/kaldi/python3/lib/python3.7/site-packages/awscli/customizations/s3/filegenerator.py", line 355, in _list_single_object
response = self._client.head_object(**params)
File "/home/kaldi/python3/lib/python3.7/site-packages/botocore/client.py", line 386, in _api_call
return self._make_api_call(operation_name, kwargs)
File "/home/kaldi/python3/lib/python3.7/site-packages/botocore/client.py", line 705, in _make_api_call
raise error_class(parsed_response, operation_name)
botocore.exceptions.ClientError: An error occurred (403) when calling the HeadObject operation: Forbidden
fatal error: An error occurred (403) when calling the HeadObject operation: Forbidden
2021-07-22 16:40:56,020 - Thread-1 - awscli.customizations.s3.results - DEBUG - Shutdown request received in result processing thread, shutting down result thread.
Download from AWS is failed, check your credential and configure your aws CLI
Can you help me?
AUDIOINFO='AUDIO_INFO'
AUDIOLIST=$2
bucketname="zeroth-opensource"
# download audio info file
if [ ! -f $data/$AUDIOINFO ]; then
aws s3 cp s3://$bucketname/$AUDIOINFO $data/$AUDIOINFO
success=$(echo $?)
if [ $success -ne 0 ]; then
echo "Download from AWS is failed, check your credential and configure your aws CLI"
exit 1
fi
fi
# download Audio
echo "Now download Audio ----------------------------------------------------"
for file in $AUDIOLIST
do
echo "check if $file.tar.gz exist or not"
if [ ! -f $data/$file.tar.gz ]; then
aws s3 cp s3://$bucketname/$file.tar.gz $data/$file.tar.gz
else
echo " $data/$file.tar.gz already exist"
fi
done
안녕하세요. ios에 모아코인이 안보입니다.
혹시 제가 data를 받아고싶은데 어떻게 받을수있는 방법이 없을까요?
감사합니다.
change run_openslr.sh
local/dowonload_from_openslr.sh -> local/dowonload_from_openslr.sh $data
and change download_from_openslr.sh
speechData -> $1
Hi,
Is there a pre-trained model of Korean for testing ?
best regards,
Luke
안녕하세요. 고려대 금민석입니다.
한국어 음성인식에 큰 기여를 해주시는 점 다시 한번 감사드립니다.
github 은 clone 만 해보았는데 issue 에 글을 올려보는 것은 처음이네요. ^^
아직 음향모델 훈련을 돌려보지는 않았는데요. 일단은 https://github.com/goodatlas/zeroth/blob/master/s5/RESULTS 에 나와있는 성능을 참고하여, 제가 연구에 사용하던 음성인식기로 AWS 에서 받은 test_data_01 데이터의 성능 확인을 해보고 비교해보려고 했는데요.
성능이 매우 저조하게 나와서 음성데이터를 확인해보니 제가 훈련에 사용한 ETRI 음성데이터와 음향특성에 차이가 큰 것 때문이 아닌가 생각이 됩니다. 혹시 별도의 speech enhancement 가 적용되었는지 문의드립니다. 제가 훈련한 데이터가 별도의 RIR 처리를 안해서 그런 것 같기도 하네요.
If we make the assumption that the input text is going to be UTF-8 file format, I think that The following needs to be modified from
zeroth/s5/local/updateSegmentation.sh
Line 30 in 3cbb45c
morfessor -l $lmDir/zeroth_morfessor.seg -T - -o - \
morfessor -e 'utf-8' -l $lmDir/zeroth_morfessor.seg -T - -o - \
This is because morfessor defaults to the assumption that the text is ASCII.
NOTE: I'm referring to the RESULTS file on the current Kaldi commit, not goodatleas/zeroth
Hi, I tried running the provided recipes for zeroth_korean on kaldi. I didn't change anything on the scripts but when I looked at the evaluations, I found that the test set is only giving me 6641 "words" (individual tokens separated by space). The RESULTS text file however, shows that there are 9253 words. What do you think is the issue why there is a discrepancy with the data I'm processing?
Another insight on my current setup, when I reached nnet3's common procedures for ivector extraction, I first tried turning off speed perturbation. Once it reaches the point where the training data is being subset to 30,000 utterances, it gave an error because the training set (train_clean) is only 22263 files. Maybe there's something I'm missing?
I want to download the raw dataset by Morecoin.
But due to some technical reason, I cannot sign-up the Morecoin.
Does anything I can do about this?
안녕하세요.
모아코인 앱을 이용해서 음성 녹음을 하려고 했는데,
No open campaign found 라고 뜨며 더 이상 진행이 되지 않고 있습니다.
관련해서 확인 부탁 드릴께요.
@최신 음성 데이터를 얻을 수 있는 다른 방법이 있다면 공유해 주시면 감사하겠습니다 (__)
closed
안녕하세요
좋은 자료에 감사드립니다.
페북 포스팅을 보고 G2P 및 LM을 어떻게 구성하셨는지 보다가
Morfessor에 대해 궁금한 점이 있어 문의를 드립니다.
우연히 형태소 인식단위 G2P와 단어 단위의 G2P 비교를 독일어에 대해 연구한 자료를 알게 되었습니다.
https://nats-www.informatik.uni-hamburg.de/pub/SLP16/WebHome/POSTER-g2p.pdf
여기서는 단어 단위의 G2P가 더 좋은 성능으로 나오는 것으로 소개가 되어 있어
한국어는 어떻게 비교가 될지 무척 궁금해졌습니다.
관련 해서 혹시 비교해보신 경험이나 연구 내용이 있으신지요..
권호민 드림
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.