Comments (4)
The barley dataset comes from Becker: https://mbostock.github.io/protovis/ex/barley.html
from vega-datasets.
Forgive my ignorance, but why is it sourced to Becker? The version I see packaged for R sources Immer and Cleveland.
from vega-datasets.
Ah, is it because Becker is cited for making the trellis chart from the barley data? A Becker paper says:
The barley experiment was run in the 1930s. The data first appeared
in a 1934 report published by the experimenters. Since then, the data have
been analyzed and re-analyzed. R. A. Fisher presented the data for five of
the sites in his classic book, The Design of Experiments. Publication in the
book made the data famous, and many others subsequently analyzed them,
usually to illustrate a new statistical method.
Then in the early 1990s, the data were visualized by Trellis Graphics.
The result was a big surprise. Through 60 years and many analyses, an
important happening in the data had gone undetected.
from vega-datasets.
Here is another Becker paper I believe we can cite to write a solid source entry for the data, which he drew from elsewhere.
In the 1930s an experiment was run in the state of Minnesota. At six sites, ten
varieties of barley were grown in each of two years. The data collected for the experiment
are the yields for all combinations of site, variety, and year, so there are 6 x to x 2 = 120
observations. The experiment is of historical interest because it is one of the early field
trials that incorporated R. A. Fisher's ideas on randomization and the analysis of variance.
The agronomists published the data and an analysis of them in Immer, Hayes, and Powers
(1934). Fisher published the data in his classic book, The Design ofExperiments (Fisher 1971), but he did not present an analysis. Fisher's publication gave the data a large
exposure, and many others tried their hands at analyzing them to illustrate new statistical
methods (Anscombe 1981, 1983; Daniel 1976). We will do the same here, using the
data to illustrate Trellis display. The visualization using Trellis reveals an important
happening in the data-there appears to be a major error, one that survived undetected
for six decades (Cleveland 1993).
from vega-datasets.
Related Issues (20)
- Movies release dates off by 100 years HOT 2
- Add OHLC Data HOT 7
- add vega-datasets JS notebook to docs ...
- Update sf-temps HOT 1
- Clean up for 2.0 HOT 6
- Update to 2017 Census HOT 8
- Urls should point to stable source
- Add penguin data
- Birdstrikes dataset missing HOT 2
- Is their a license for this dataset? HOT 1
- Can not load earthquakes dataset HOT 1
- build/vega-datasets.min.js is now an iife, breaking require HOT 3
- Published build artifacts have the wrong version (2.5.0 instead of 2.5.1) HOT 5
- 7 datasets that cannot be loaded HOT 1
- movies.json Release Date is sometimes in the future HOT 3
- Unable to process .arrow file in the datasets HOT 3
- Address data inconsistencies and absence of versioning or sourcing in gapminder data HOT 6
- Unclear provenance of crimea.json dataset ('Nightingale's Rose') HOT 2
- Correct errors in monarchs.json or document them in SOURCES.md? HOT 1
- An in-range update of rollup is breaking the build 🚨 HOT 5
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from vega-datasets.