Comments (3)
#5423
Updated dataset link: https://www.statmt.org/wmt16/index.html
from fairseq.
#5423 Updated dataset link: https://www.statmt.org/wmt16/index.html
Excuse me, the preprocessed dataset link in Training a new model on WMT'16 En-De is still broken.
Or can you tell me how to preprocess the dataset downloaded from the link you offered?
thank you very much.
from fairseq.
The dataset link that used in https://github.com/facebookresearch/fairseq/blob/main/examples/scaling_nmt/README.md is broken. Is there any place I can find the dataset?
Have you found other ways to download the prerpocessed data?
from fairseq.
Related Issues (20)
- Removing logs of Fairseq
- How MMS uses microphones in real-time
- cannot translate the whole paragraph/sentences HOT 1
- RuntimeError: Expected 2D (unbatched) or 3D (batched) input to conv1d, but got input of size: [1, 1, 191, 80]
- decoding in Hubert: RecursionError: maximum recursion depth exceeded in comparison
- Multi-modal data2vec (joint training)
- This model has order 20 but KenLM was compiled to support up to 6
- Mandarin and Cantonese language model not found
- Addition dot in translated text
- 'TranslationTask' object has no attribute 'args'
- How can I get WER (train/valid) for the audio_finetuning task with CTC?
- Use symbolic link for saving best/last checkpoints
- object of type 'NoneType' has no len() HOT 1
- Training a model with a custom FairseqDataset implementation
- Request for Inclusion of Sindhi Language in Multilingual Model for Speech Recognition
- Add Saraiki language
- src_seq_length fixed length error when training MoE model
- Add `--validate-after-epochs` training flag
- Inference on MoE models
- Facebook/mms-tts-deu speaks two voices at once, male and female
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from fairseq.