Comments (1)
Hi @joyolee.
Sorry for the late reply!
My team and I have added the training and decoding scripts recently, so feel free to check them. All hyper-parameters used for fine-tuning can be found in this YAML configuration file. However, we had to change a few parameters as shown in the training script
For your reference, the following are the answer to your questions:
how many warmup_steps?
10,000 steps
how many hold_steps?
always 0
how many decay_steps?
20,000 steps
And how many freeze_finetune_updates did you set?
non-English models used 4,000 steps out of 30,000 total steps. And the English model used 24,000 steps out of 90,000 total steps.
The second question is about punctuation removal and lowercasing before calculating WER. Because I also observed some special tokens, e.g. the music token ♪ in the dictionary. Which tokens have you removed and how?
Yes, we've used Fairseq's WerScorer which removes punctuations and lower-case the text.
I hope I answered all of your question. I'm gonna close this for now, but feel free to re-open it when needed.
from muavic.
Related Issues (20)
- Minor issue HOT 2
- Error when preprocessing the video data HOT 1
- A small bug during audio pre-processing HOT 1
- Got error when preparing LRS3 HOT 5
- download_ted2020() error HOT 4
- TEDx Talk with ID=D4TE28-L7FI is not available anymore HOT 5
- Error running the data prep script HOT 7
- Error when generating the manifest for AVSR HOT 3
- Unable to download corpora other than English HOT 1
- Problems when Downloading the Italian Dataset HOT 2
- VSR performance lower on MuAViC version of LRS3 (En) HOT 2
- Empty X -> EN translations HOT 2
- Noise parameters for decoding and training HOT 6
- Multilingual AVSR model decoding and training HOT 2
- Problem met when downloading German data HOT 2
- Only audio files could be downloaded
- Could you please tell me what version your 'sox' is? HOT 3
- How much storage do I need in total to download the muavic dataset?
- RuntimeError('Error(s) in loading state_dict for {}:\n\t{}'.format( RuntimeError: Error(s) in loading state_dict for AVHubertSeq2Seq: HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from muavic.