Comments (4)
Ok, It;s all good now!
from muavic.
Hi @MrSupW ,
Thanks for submitting this issue. It was a simple change in the TED2020 dataset URL. Please, run git pull
to update the current repo with the latest changes.
Thanks!
from muavic.
download_ted2020 function is all good now
But a new problem arises in prepare_lrs3_avst_manifests function. Here is the traceback@Anwarvic
Extracting en-el: 100%|██████████| 262659/262659 [00:00<00:00, 814098.07it/s]
Extracting en-es: 100%|██████████| 405955/405955 [00:00<00:00, 1101689.57it/s]
Extracting en-fr: 100%|██████████| 399853/399853 [00:00<00:00, 958119.07it/s]
Extracting en-it: 100%|██████████| 363786/363786 [00:00<00:00, 944902.82it/s]
Extracting en-pt: 100%|██████████| 320286/320286 [00:00<00:00, 883210.78it/s]
Extracting en-ru: 100%|██████████| 380270/380270 [00:00<00:00, 675357.42it/s]
Creating AVST manifests
en-el AVST manifest: 0%| | 0/3 [00:01<?, ?it/s]
Traceback (most recent call last):
File "get_data.py", line 107, in
main(args)
File "get_data.py", line 73, in main
prepare_lrs3(args)
File "get_data.py", line 60, in prepare_lrs3
prepare_lrs3_avst_manifests(args["mt_trans"], args["ted2020"], args["muavic"])
File "/mnt/pfs/wanghe/corpus/muavic/muavic/lrs3_utils.py", line 401, in prepare_lrs3_avst_manifests
split_ids = read_txt_file(
File "/mnt/pfs/wanghe/corpus/muavic/muavic/utils.py", line 27, in read_txt_file
with open(txt_filepath) as fin:
FileNotFoundError: [Errno 2] No such file or directory: '/mnt/pfs/wanghe/corpus/muavic/mt_trans/en-x/train_id.txt'
from muavic.
Hi @MrSupW ,
Thanks for highlighting this issue! The code responsible for downloading '/mnt/pfs/wanghe/corpus/muavic/mt_trans/en-x.tgz
(which has train_id.txt
file that causing your issue) was missing. I've updated the public repo with the missing code. Please, run git pull
to update your repo with the latest code version.
Thanks again for raising these issues!
from muavic.
Related Issues (20)
- Minor issue HOT 2
- Error when preprocessing the video data HOT 1
- A small bug during audio pre-processing HOT 1
- Got error when preparing LRS3 HOT 5
- TEDx Talk with ID=D4TE28-L7FI is not available anymore HOT 5
- Error running the data prep script HOT 7
- Error when generating the manifest for AVSR HOT 3
- Questions towards hyper-parameters and the token post-processing HOT 1
- Unable to download corpora other than English HOT 1
- Problems when Downloading the Italian Dataset HOT 2
- VSR performance lower on MuAViC version of LRS3 (En) HOT 2
- Empty X -> EN translations HOT 2
- Noise parameters for decoding and training HOT 6
- Multilingual AVSR model decoding and training HOT 2
- Problem met when downloading German data HOT 2
- Only audio files could be downloaded
- Could you please tell me what version your 'sox' is? HOT 3
- How much storage do I need in total to download the muavic dataset?
- RuntimeError('Error(s) in loading state_dict for {}:\n\t{}'.format( RuntimeError: Error(s) in loading state_dict for AVHubertSeq2Seq: HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from muavic.