Git Product home page Git Product logo

Comments (4)

domoritz avatar domoritz commented on September 18, 2024

The barley dataset comes from Becker: https://mbostock.github.io/protovis/ex/barley.html

from vega-datasets.

palewire avatar palewire commented on September 18, 2024

Forgive my ignorance, but why is it sourced to Becker? The version I see packaged for R sources Immer and Cleveland.

from vega-datasets.

palewire avatar palewire commented on September 18, 2024

Ah, is it because Becker is cited for making the trellis chart from the barley data? A Becker paper says:

The barley experiment was run in the 1930s. The data first appeared
in a 1934 report published by the experimenters. Since then, the data have
been analyzed and re-analyzed. R. A. Fisher presented the data for five of
the sites in his classic book, The Design of Experiments. Publication in the
book made the data famous, and many others subsequently analyzed them,
usually to illustrate a new statistical method.

Then in the early 1990s, the data were visualized by Trellis Graphics.
The result was a big surprise. Through 60 years and many analyses, an
important happening in the data had gone undetected.

from vega-datasets.

palewire avatar palewire commented on September 18, 2024

Here is another Becker paper I believe we can cite to write a solid source entry for the data, which he drew from elsewhere.

In the 1930s an experiment was run in the state of Minnesota. At six sites, ten
varieties of barley were grown in each of two years. The data collected for the experiment
are the yields for all combinations of site, variety, and year, so there are 6 x to x 2 = 120
observations. The experiment is of historical interest because it is one of the early field
trials that incorporated R. A. Fisher's ideas on randomization and the analysis of variance.
The agronomists published the data and an analysis of them in Immer, Hayes, and Powers
(1934). Fisher published the data in his classic book, The Design ofExperiments (Fisher 1971), but he did not present an analysis. Fisher's publication gave the data a large
exposure, and many others tried their hands at analyzing them to illustrate new statistical
methods (Anscombe 1981, 1983; Daniel 1976). We will do the same here, using the
data to illustrate Trellis display. The visualization using Trellis reveals an important
happening in the data-there appears to be a major error, one that survived undetected
for six decades (Cleveland 1993).

from vega-datasets.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.