Comments (2)
Hi, thank you for reporting!
This is definitely a bug.
Workaround: add the following arg to your tfds.load
call:
tfds.load(..., download_and_prepare_kwargs={'file_format': tfds.core.FileFormat.ARRAY_RECORD})
We'll look on how to update the code and update on the bug.
from datasets.
It's still giving error.
import tensorflow_datasets as `tfds`
plant_leaves_data, plant_leaves_info = tfds.load('plant_leaves', split='train', shuffle_files=True, download_and_prepare_kwargs={'file_format': tfds.core.FileFormat.ARRAY_RECORD})
Gives
Downloading and preparing dataset 6.56 GiB (download: 6.56 GiB, generated: 6.81 GiB, total: 13.37 GiB) to /root/tensorflow_datasets/plant_leaves/0.1.1...
---------------------------------------------------------------------------
RuntimeError Traceback (most recent call last)
[<ipython-input-3-608b46b22c6c>](https://localhost:8080/#) in <cell line: 4>()
2 #plant_leaves = tfds.load('plant_leaves', split='train', shuffle_files=True)
3 #plant_leaves_data, plant_leaves_info = tfds.load('plant_leaves', split='train', shuffle_files=True, as_data_source=True)
----> 4 plant_leaves_data, plant_leaves_info = tfds.load('plant_leaves', split='train', shuffle_files=True, download_and_prepare_kwargs={'file_format': tfds.core.FileFormat.ARRAY_RECORD})
5 frames
[/usr/local/lib/python3.10/dist-packages/tensorflow_datasets/core/logging/__init__.py](https://localhost:8080/#) in __call__(self, function, instance, args, kwargs)
167 metadata = self._start_call()
168 try:
--> 169 return function(*args, **kwargs)
170 except Exception:
171 metadata.mark_error()
[/usr/local/lib/python3.10/dist-packages/tensorflow_datasets/core/load.py](https://localhost:8080/#) in load(name, split, data_dir, batch_size, shuffle_files, download, as_supervised, decoders, read_config, with_info, builder_kwargs, download_and_prepare_kwargs, as_dataset_kwargs, try_gcs)
645 try_gcs,
646 )
--> 647 _download_and_prepare_builder(dbuilder, download, download_and_prepare_kwargs)
648
649 if as_dataset_kwargs is None:
[/usr/local/lib/python3.10/dist-packages/tensorflow_datasets/core/load.py](https://localhost:8080/#) in _download_and_prepare_builder(dbuilder, download, download_and_prepare_kwargs)
504 if download:
505 download_and_prepare_kwargs = download_and_prepare_kwargs or {}
--> 506 dbuilder.download_and_prepare(**download_and_prepare_kwargs)
507
508
[/usr/local/lib/python3.10/dist-packages/tensorflow_datasets/core/logging/__init__.py](https://localhost:8080/#) in __call__(self, function, instance, args, kwargs)
167 metadata = self._start_call()
168 try:
--> 169 return function(*args, **kwargs)
170 except Exception:
171 metadata.mark_error()
[/usr/local/lib/python3.10/dist-packages/tensorflow_datasets/core/dataset_builder.py](https://localhost:8080/#) in download_and_prepare(self, download_dir, download_config, file_format)
679 # to generate the files.
680 if file_format:
--> 681 self.info.set_file_format(file_format, override=True)
682
683 # Create a tmp dir and rename to self.data_dir on successful exit.
[/usr/local/lib/python3.10/dist-packages/tensorflow_datasets/core/dataset_info.py](https://localhost:8080/#) in set_file_format(self, file_format, override)
470 )
471 if override and self._fully_initialized:
--> 472 raise RuntimeError(
473 "Cannot override the file format "
474 "when the DatasetInfo is already fully initialized!"
RuntimeError: Cannot override the file format when the DatasetInfo is already fully initialized!
from datasets.
Related Issues (20)
- [data request] <emnist>
- Error when processing speech_commands dataset HOT 1
- [data request] <poker>
- tfds failed to load open-x-embodiement dataset HOT 4
- Cannot build hugging face datasets HOT 1
- [data request] figshare brain tumor dataset HOT 3
- NonMatchingChecksumError while loading the dataset plant_leaves HOT 3
- Invalid UTF8 bytes in default TAGS.txt
- [data request] malaria HOT 2
- Installation issues HOT 1
- RecursionError using tfds.load to import tensorflow-dataset (Mac) HOT 2
- image classification HOT 1
- 'open_images_v4' with 'array_record' raise ValueError
- [Failed to get device properties, error code: 3 | tfds.numpy()] HOT 3
- Higgs Dataset - ValueError on download_and_prepare() HOT 1
- out of Memory HOT 2
- [data request] <dataset name> HOT 2
- etils.epy.lazy_imports not found HOT 5
- AttributeError: module 'tree' has no attribute 'map_structure'
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from datasets.