Comments (6)
You might want to filter out some cuts that are too long. You likely have outliers with large duration.
from icefall.
You might want to filter out some cuts that are too long. You likely have outliers with large duration.
Yeah, I got it.
I think if I set max-duration, the total duration in a batch will not exceed this value, right?
So maybe some transcript in this batch is too long?
from icefall.
Yeah something like that, I’d say too long audio/features.
BTW The total duration of supervised chunks won’t exceed max duration. It means there could be actually more due to padding. But with BucketingSampler the amount of padding is negligible.
from icefall.
Yeah something like that, I’d say too long audio/features.
BTW The total duration of supervised chunks won’t exceed max duration. It means there could be actually more due to padding. But with BucketingSampler the amount of padding is negligible.
So for the same data, icefall will need more gpu memory compared to kaldi, right?
If I remember correctly, in kaldi's training, there is a param egs.chunk-width (such as 150,110,100) to control the frames per batch. I usually use minibatch=64, so the max duration per batch may be larger than 64s. And I never see OOM, and the gpu usage is usually > 90%.
from icefall.
In general any alignment-free training will require more memory due to padding. I think that Kaldi was able to optimize the memory usage because alignment gives you frame-level supervision, so you can take chunks of utterances. On the other hand, for CTC or attention decoder you have to use the whole utterance.
from icefall.
Thanks, I'll close this issue.
By the way, for docker user, --shm-size is also needed to set for parallel training.
from icefall.
Related Issues (20)
- zipformer-adapter streaming_forward without adapters. HOT 4
- Feature extraction for 5000 hours of data HOT 4
- Plans to make installation simpler HOT 14
- How to use an external RNN-LM (mono-lingual) with a bilingual ASR? HOT 3
- json.decoder.JSONDecodeError,when I run wenetspeech prepare.sh HOT 1
- kaldi经典的强制对齐算法怎么在k2实现呢 HOT 1
- export a non-stream onnx model from a streaming pytorch model HOT 6
- A question about the data preparation on AMI corpus HOT 9
- Decoding conformer_ctc trained on TIMIT with ctc-decoding HOT 24
- 关于wenetspeech的指标是不是有一点问题 HOT 5
- What is the purpose of --lr-hours config in LibriHeavy recipe? HOT 2
- Using a BTC/OTC in the training Zipformer instead of Conformer. HOT 10
- Decoding Issue: fast beam search nbest LG HOT 1
- Is there any recipe for a Spanish model? HOT 1
- Is it possible to do reverberation on the fly? HOT 7
- Mamba implementation under icefall HOT 1
- Seeking advice on parameter configuration and settings for large-scale ASR models HOT 1
- initial decoder input in onnx decoding results in deletion errors HOT 1
- 使用sherpa-onnx-streaming-zipformer-ctc-multi-zh-hans-2023-12-13模型进行语音识别,每次重新启动时都有首字不能识别的问题。 HOT 1
- Decoding using LM with Contextual biasing (Hotwords)
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from icefall.