Sample Data: Anscombe Regression Lines

Source Notebook

Anscombe's 4 regression line data

These four datasets that have nearly identical simple descriptive statistics, yet appear very different when graphed. Each dataset consists of eleven (x,y) points. They were constructed in 1973 by the statistician Francis Anscombe to demonstrate both the importance of graphing data before analyzing it and the effect of outliers on statistical properties.

(8 columns, 11 rows)

Examples

Basic Examples

Retrieve the ResourceObject:

In[1]:=
ResourceObject["Sample Data: Anscombe Regression Lines"]
Out[1]=

View the data:

In[2]:=
ResourceData["Sample Data: Anscombe Regression Lines"]
Out[2]=

Visualization

Plot the four datasets with linear regression line:

In[3]:=
Show[Plot[3 + 0.5 x, {x, 2, 16}], #] & /@ ListPlot /@ (ResourceData["Sample Data: Anscombe Regression Lines"][
      All, #] & /@ {{"X1", "Y1"}, {"X2", "Y2"}, {"X3", "Y3"}, {"X4", "Y4"}})
Out[3]=

Wolfram Research, "Sample Data: Anscombe Regression Lines" from the Wolfram Data Repository (2016)   https://doi.org/10.24097/wolfram.70071.data

Data Resource History

Source Metadata

Data Downloads

Publisher Information