Comments (4)
I am thinking about adding a recipe for something conversational, to illustrate the need for/benefits of multiple supervision segments with actual acoustic context. I could prepare a recipe for this, e.g. on Switchboard? Not sure if there's a similar corpus that is also open (maybe GigaSpeech, possibly a subset of it).
from icefall.
from icefall.
Don't worry, we'll make it ;)
Do you think we should copy the code from snowfall and adjust it somehow or just work on the snowfall repo for the time being? I think given that we're trying to limit the scope, maybe it makes sense to just stick to snowfall for now. Do you envision any serious changes to how the code is organized there by the tutorial deadline?
from icefall.
I think we should take the opportunity to clean up interfaces a little and rework the directory structure a bit,
and get rid of any old versions of code that we don't want. But we don't need to make very fundamental changes, I think.
from icefall.
Related Issues (20)
- How to run a streaming zipformer transducer model with my own dataset? HOT 1
- get data manifest for SSL recipe
- Training speed is not improved by using a better GPU HOT 14
- librispeech SSL finetune.py throwing error
- Training with disfluencies in speech
- librispeech hubert pretrain.py throwing error : UnboundLocalError: local variable 'sub_batch_idx' referenced before assignment HOT 1
- error in librispeech SSL pretrain.py HOT 1
- How did you prepare the manifest dir for pretrain and in which format? HOT 1
- early context injection HOT 3
- LLM based speech recognition HOT 1
- Recommended recipe for noisy 5K hours noisy training data HOT 1
- How to fine-tune KWS without downloading wenetspeedch HOT 1
- Libriheavy train_bert_encoder.py incompatible with Lhotse 1.27.0 HOT 4
- Early Stopping of Token Generation in Streaming Model Training HOT 27
- How to specify zipformer model training with different specifications HOT 3
- Zipformer-s model training problem , thanks!
- gigaspeech数据集下载成功,但解压缩失败 HOT 1
- Troubles with streaming decode HOT 16
- Use multi_zh-hans finetune whisper get erro
- [Transducer Loss] Why not normalize transducer loss
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from icefall.