fmow / dataset Goto Github PK
View Code? Open in Web Editor NEWLicense: Other
License: Other
I'm trying to download rgb-trainval. I can connect but the download speed remains at zero.
I've found two corrupted images in the dataset on AWS. The following MWE demonstrates the issues
aws s3 cp s3://spacenet-dataset/Hosted-Datasets/fmow/fmow-full/train/helipad/helipad_373/helipad_373_3_rgb.tif train-bad-helipad.tif
aws s3 cp s3://spacenet-dataset/Hosted-Datasets/fmow/fmow-full/val/helipad/helipad_107/helipad_107_3_rgb.tif val-bad-helipad.tif
gdalinfo val-bad-helipad.tif
gdalinfo train-bad-helipad.tif
Results in:
gdalinfo failed - unable to open 'val-bad-helipad.tif'.
gdalinfo failed - unable to open 'train-bad-helipad.tif'.
More detailed mwe:
python -c "from osgeo import gdal; gdal.UseExceptions(); gdal.Open('val-bad-helipad.tif')"
python -c "from osgeo import gdal; gdal.UseExceptions(); gdal.Open('train-bad-helipad.tif')"
Results in:
RuntimeError: val-bad-helipad.tif: MissingRequired:TIFF directory is missing required "StripOffsets" field
RuntimeError: train-bad-helipad.tif: MissingRequired:TIFF directory is missing required "StripOffsets" field
Is this dataset still available? I'm trying to download it with aws s3 sync s3://fmow-full . --request-payer requester
and I'm getting [Errno 13] Permission denied
Hello,
AFAIK, there are some fields in the metadata files that are not documented, such as the "raw_polygon"
field.
Is this supposed to be the polygon enclosing the image, with geospatial coordinates ?
What is the syntax of the value of this field ? Is it possible to convert it to a GeoJSON ?
Thanks
Could the md5 hash sums for torrents be provided?
Hey, both fMoW-rgb train and val
and fMoW-rgb test
torrents are down (not seeded), will they be ever available again? I was about to use them in my research. Thanks.
I'm trying to download the data on S3 right now. I have opened an account and installed the command line interface. But I don't get which command I need to download the rgb data.I tried something like:
aws s3 cp s3://spacenet-dataset/Hosted-Datasets/fmow/fmow-rgb ./
, but I get an error that the key Hosted-Dataset doesn't exist. I need to download and store the dataset in a specific file.
Thank you in advance.
Hi there,
I was wondering if GPS coordinate data for the train and validation sets are made available anywhere? Thank you!
Hello,
I just downloaded the AWS repository of rgb images (s3://spacenet-dataset/fmow/fmow-rgb/), but unfortunately it does not contain the ground truth data for test and seq folders. Is this information going to be made available soon? I also tried to download the torrent version (https://github.com/fMoW/dataset/raw/master/fMoW-rgb_seqandgt_v1.2.0.torrent), but it currently has no seeds.
Best regards,
Mauricio
/
/需要数据集可以联系我微信 web88518
/
As the image size varies for the same location in different timestamp, I wonder how to use the sequential images in tasks such as change detection where input image size should be the same.
Or, is there an anchor point that align the sequential images position.
There does not to be any seeds for fMoW-rgb_test_v1.0.0.torrent currently.
I have been trying to download the dataset using s3 command. But it says "Access Denied"
aws s3 cp s3://spacenet-dataset/Hosted-Datasets/fmow/fmow-rgb . --recursive
fatal error: An error occurred (AccessDenied) when calling the ListObjectsV2 operation: Access Denied
The ground truth metadata json's denote the bounding box dimensions as 'box': [697, 1451, 5720, 2649]
, but not this repository nor the fMoW paper describes what these dimensions mean. You have to search the baseline code and find in this function that these represent [x, y, w, h]
where [x, y]
are the upper (?) left corner's indices. Other bounding box annotation schemes exist, e.g. see here, so I think this would be something nice to have clearly documented somewhere.
When I attempt to download fMoW dataset, I noticed that only torrent files are available on GitHub or AWS. However, upon trying to download the seed file, I found that no user owns any bucket, making it impossible for me to obtain the dataset corresponding to the seed. Could you please provide a more stable download method for us? Thank you very much for your assistance.
The paper mentions that the metadata includes ISO Alpha-3 country codes (Appendix I). However, the country_code
field in the metadata json's includes two codes that do not exist as ISO Alpha-3 code: KO-
and CA-
.
KO-
is not listed as any country code on the ISO website. KO
is listed as unassigned Alpha-2 code: link. All locations in the fMoW dataset with this country code are in Kosovo. Kosovo does not have its own ISO Alpha-3 code, the correct code is the one for Serbia: SRB
(just quoting the ISO website). There is an unofficial Alpha-2 code for Kosovo, as described here; XK
.CA-
, it does not exist as Alpha-3 code. CA
exists as Alpha-2 code for Canada (ISO website), but the fMoW metadata uses the Alpha-3 code CAN
to refer to Canada. The locations in the fMoW dataset with CA-
are around the Caspian Sea, both in southern Russia (Dagestan) and Azerbaijan. I don't know why the proper codes, like RUS
or AZE
wouldn't be used. Some of these locations are in Azerbaijan's capital, so doesn't seem like disputed territory.I'm posting this here not to get political but because I think this might be relevant to some people, and it is poorly documented by fMoW. E.g. I was trying to connect the fMoW countries to ISO 3 country codes in a shapefile to do operations with the locations. Please document this properly somewhere :-).
I'm trying to download rgb-trainval, but there are no seeds available and I do not have an AWS account.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.