Comments (5)
@yjernite is this a duplicate of #174?
from data_tooling.
#self-assign
from data_tooling.
Done here https://huggingface.co/datasets/bigscience-catalogue-data/bookdash-books/tree/main, what's next?
from data_tooling.
Thanks @afaji.
Needs to be addressed, as it gives StopIteration error:
from datasets import load_dataset
ds = load_dataset("bigscience-catalogue-data/bookdash-books", split="train", streaming=True, use_auth_token=True)
item = next(iter(ds))
item
Using custom data configuration bigscience-catalogue-data--bookdash-books-cd95bc918e97c5c4
---------------------------------------------------------------------------
StopIteration Traceback (most recent call last)
from data_tooling.
Migrated the repo to bookdash_books
: https://huggingface.co/datasets/bigscience-catalogue-data/bookdash_books
I close this issue as duplicated with:
from data_tooling.
Related Issues (20)
- Create dataset xnli
- Create dataset indonesian_news_articles_2017 HOT 4
- Create dataset tsac
- Create dataset science_magazing_aaas_academic_journal_ HOT 1
- Create dataset ekantipur_com
- Create dataset nurition_fact
- Create dataset information_week_digital_magazine
- Create dataset du_reader HOT 4
- Create dataset wikihow_vietnamese_human_instructions HOT 2
- Create dataset MT_Vi_Mono_VLSP2020 HOT 4
- Create dataset malindomorph__morphological_dictionary_and_analyser_for_malay_indonesian
- Create dataset human_instructions_in_indonesian_extracted_from_wikihow
- Create dataset mind_body_green
- Create dataset vanguard_daily_media
- Create dataset opus_100 HOT 2
- Create dataset odiencorp2_0 HOT 4
- Create license-compliant version of the Pile: Stack Exchange HOT 1
- Create license-compliant version of the Pile: EuroParl HOT 1
- Citing this resource HOT 4
- Reason for not applying remove_non_prining_characters normalization HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from data_tooling.