Wolfram Research

Beowulf - Modern English

Beowulf (Modern English Translation)

Japanese-English Subtitle Corpus

A parallel corpus for machine translation systems, information extraction and other language processing techniques

Japanese-English Legal Parallel Corpus

A parallel corpus for machine translation systems, information extraction and other language processing techniques

Beowulf - Old English

The oldest surviving long poem in Old English about Beowulf, a warrior, battling a monster known as a Grendel.

Don Quixote - English

The Adventures of Don Quixote, Volume One, by Miguel de Cervantes

Plato's Meno - English

A Socratic dialogue between Plato and Meno on the definition of human virtue.

Code of Hammurabi – English

Babylonian code of 282 laws that is the oldest surviving written law code and known for the punishment of "an eye for an eye, a tooth for a tooth."

Europarl English-Spanish Machine Translation Dataset V7

A parallel corpus for machine translation from the proceedings of the European Parliament

Europarl English-German Machine Translation Dataset V7

A parallel corpus for machine translation from the proceedings of the European Parliament

Europarl English-Italian Machine Translation Dataset V7

A parallel corpus for machine translation from the proceedings of the European Parliament

Europarl English-French Machine Translation Dataset V7

A parallel corpus for machine translation from the proceedings of the European Parliament

Prufrock

The Love Song of J. Alfred Prufrock, by T. S. Eliot

Magna Carta

Text of Magna Carta Libertatum (the Great Charter of the Liberties) which agreed to grant a fair trial and right to justice to all 'free men'

Shakespeare's Sonnets

A collection of 154 sonnets by William Shakespeare.

Alice in Wonderland

Alice's Adventures in Wonderland, by Lewis Carroll

The 20-Task bAbI Question-Answering Dataset v1.2

A dataset for question answering and text understanding in both Hindi and English

On the Origin of Species

On the Origin of Species By Means of Natural Selection, or, the Preservation of Favoured Races in the Struggle for Life, by Charles Darwin

Kyoto Free Translation Task Data

A parallel corpus for the evaluation and development of Japanese-English machine translation systems

Round the Moon

Plaintext in English for Jules Verne's "Round the Moon"

World Constitutions

Full text of national constitutions

Emojis

Table of emojis and their metadata

Sample Data: UCI Letter

Letter recognition dataset

Irish-Viking Networks in 'Cogadh Gaedhel re Gallaibh'

Graph datasets for Irish and Viking character relationships in the medieval Irish text 'Cogadh Gaedhel re Gallaibh' ('The War of the Gaedhil with the Gaill')

Video Games

Dataset from the Internet Games Database API as of June 2018

Sample Image: Girl on a Blue Background

Sample Image of a Girl

Sample Data: Pacific Walrus Haulouts

Congregations of Pacific walruses off the coast of U.S. and Russia, 1852-2016

Agnes Grey

Plaintext for Anne Brontë's "Agnes Grey"

Westward Ho!

Plaintext for Charles Kingsley's "Westward Ho!"

SQuAD v2.0 Tokens Generated with WL

A list of isolated words and symbols from the SQuAD dataset, which consists of a set of Wikipedia articles labeled for question answering and reading comprehension

NYC Emergency Response Incidents

NYC Open Data makes the wealth of public data generated by various New York City agencies and other City organizations available for public use. This catalog offers access to a repository of government-produced, machi...

Tarzan and the Jewels of Opar

Plaintext for Edgar Rice Burroughs' "Tarzan and the Jewels of Opar"

Top500 List of Supercomputers - June 2016

The TOP500 table shows the 500 most powerful commercially available computer systems known to us.

Nematode Pharynx Graph

Connectome of anterior section of hermaphrodite Caenorhabditis elegans pharynx

A Portrait of the Artist as a Young Man

Plaintext for James Joyce's "A Portrait of the Artist as a Young Man"

Theorem Network from Euclid's Elements

A graph of the theorems from Euclid's Elements and their logical dependencies

USDA Summer Food Service Programs

Information about the Summer Food Service Program (SFSP)

The Golden Bowl

Plaintext for Henry James' "The Golden Bowl"

The Beasts of Tarzan

Plaintext for Edgar Rice Burroughs' "The Beasts of Tarzan"

Human Cell Counts

BioNumbers

HUD Location Affordability Index

The Location Affordability Index gives estimates of the percentage of a family's income dedicated to the combined cost of housing and transportion in a given location.

Railway Children

Plaintext for Edith Nesbit's "Railway Children"

National Science Foundation Grants - 2015

Data on National Science Foundation grants and associated investigators and institutions awarded in the year 2015

Nuclear Latency Dataset

Facility-specific information on sensitive nuclear plants constructed from 1939 to 2012

The Ambassadors

Plaintext for Henry James' "The Ambassadors"

The Last Man

Plaintext for Mary Shelley's "The Last Man"

Adam Bede

Plaintext for George Eliot's "Adam Bede"

Meteorite Landings

This comprehensive data set from The Meteoritical Society contains information on all of the known meteorite landings.

Rudin

Plaintext for Ivan Turgenev's "Rudin"

NYC Jobs, Aug 2015 - Aug 2016

Information about NYC jobs for Aug. 2015 - Aug. 2016

Sample Data: Plant Growth

Dried weight of plants was used to compare their yields for each of one control group and two treatment groups.

JFK Inaugural Address

The text of the inaugural address of President John F. Kennedy, which is known for "ask not what your country can do for you--ask what you can do for your country."

Notes From Underground

Plaintext for Fyodor Dostoevsky's "Notes from Underground"

The Education of Henry Adams

Plaintext for Henry Adams's "The Education of Henry Adams"

General Topology EntityStore

Textbook

The Descent of Man, and Selection in Relation to Sex

Plaintext for Charles Darwin's "The Descent of Man"

The Voyage Out

Plaintext for Virginia Woolf's "The Voyage Out"

Wonder-Book for Girls and Boys

Plaintext for Nathaniel Hawthorne's "Wonder-Book for Girls and Boys"

Tarzan the Terrible

Plaintext for Edgar Rice Burroughs' "Tarzan the Terrible"

Tik-Tok of Oz

Plaintext for L. Frank Baum's "Tik­Tok of Oz"

Hospital Medicare Spending by Claim 2014

Medicare spending per beneficiary (MSPB) by insurance claim type by hospital along with averages and percentage comparisons.

Public Housing Developments 2015

HUD's PD&R (Office of Policy Development and Research) is responsible for maintaining current information on housing needs, market conditions, and existing programs, as well as conducting research on priority housing ...

FashionMNIST

A small MNIST-like fashion product image dataset

Sample Data: Indomethicin Pharmacokinetics

Time series data about concentration of indomethicin on 6 subjects in the indomethicin pharmacokinetics experiment.

The Tin Woodman of Oz

Plaintext for L. Frank Baum's "The Tin Woodman of Oz"

The Last of the Mohicans

Plaintext for James Fenimore Cooper's "The Last of the Mohicans"

Sample Data: Anorexia Treatment

Anorexia data on weight change for young female patients.

The Arrow of Gold

Plaintext for Joseph Conrad's "The Arrow of Gold"

The Wheels of Chance

Plaintext for H. G. Wells' "The Wheels of Chance"

The Emerald City of Oz

Plaintext for L. Frank Baum's "The Emerald City of Oz"

Sample Data: 2003 US Life Table

A life table for the total United States population in 2003

The Spoilers

Plaintext for Rex Beach's "The Spoilers"

Peter And Wendy

Plaintext for J. M. Barrie's "Peter and Wendy"

Mr. Britling Sees It Through

Plaintext for H. G. Wells' "Mr. Britling Sees It Through"

The Big Time

Plaintext for Fritz Leiber's "The Big Time"

The Old Wives' Tale

Plaintext for Arnold Bennett's "The Old Wives' Tale"

A Voyage to Arcturus

Plaintext for David Lindsay's "A Voyage to Arcturus"

Pellucidar

Plaintext for Edgar Rice Burroughs' "Pellucidar"

Sample Data: Water Boiling Points in the Alps

17 observations on the boiling point of water and barometric pressure in inches of mercury.

The Pickwick Papers

Plaintext for Charles Dickens' "The Pickwick Papers"

The Secret Sharer

Plaintext for Joseph Conrad's "The Secret Sharer"

The Scarecrow of Oz

Plaintext for L. Frank Baum's "The Scarecrow of Oz"

The Lost Princess of Oz

Plaintext for L. Frank Baum's "The Lost Princess of Oz"

Near-Earth Comets

J2000 heliocentric ecliptic orbital elements of 160 Near-Earth Comets

Mansfield Park

Plaintext for Jane Austen's "Mansfield Park"

The Old Curiosity Shop

Plaintext for Charles Dickens' "The Old Curiosity Shop"

The Gods of Mars

Plaintext for Edgar Rice Burroughs' "The Gods of Mars"

United States Hail Storms 1955-2015

Information on severe hail storms in the US, including injury and property loss data

Sample Data: Guinea Pig Tooth Growth

Data on the length of odontoblasts (teeth) for 10 guinea pigs measured at each of three dose levels of Vitamin C with each of two delivery methods.

Dombey and Son

Plaintext for Charles Dickens' "Dombey and Son"

The Jungle

Plaintext for Upton Sinclair's "The Jungle"

The People That Time Forgot

Plaintext for Edgar Rice Burroughs' "The People that Time Forgot"

The Son of Tarzan

Plaintext for Edgar Rice Burroughs' "The Son of Tarzan"

U.S. Baby Names By State

A comprehensive list of frequencies of baby names in the U.S, listed by year and state, since 1910

Through the Looking Glass

Plaintext for Lewis Carroll's "Through the Looking Glass"

The House of Mirth

Plaintext for Edith Wharton's "The House of Mirth"

Sample Data: Puromycin Reaction Velocity

Velocities of enzymatic reactions with or without Puromycin.

The Head of the House of Coombe

Plaintext for Frances Hodgson Burnett's "The Head of the House of Coombe"

To the Last Man

Plaintext for Zane Grey's "To the Last Man"

The Invisible Man

Plaintext for H. G. Wells' "The Invisible Man"

Villette

Plaintext for Charlotte Brontë's "Villette"

The Woman in White

Plaintext for Wilkie Collins' "The Woman in White"

The Scarlet Pimpernel

Plaintext for Baroness Orczy's "The Scarlet Pimpernel"

Tornadoes in the U.S., 1950-2015

Tornadoes tracked by NOAA from 1950-2015

Vanity Fair

Plaintext for William Makepeace Thackeray's "Vanity Fair"

Crime and Punishment

Plaintext for Fyodor Dostoevsky's "Crime and Punishment"

Jacob's Room

Plaintext for Virginia Woolf's "Jacob's Room"

Leaves of Grass

Plaintext for Walt Whitman's "Leaves of Grass"

Hadley Center Central England Temperature (HadCET) Dataset

The CET dataset is the longest instrumental record of temperature in the world

Mass Shootings in America

Curated data on mass shootings in America from 1966 to 2016.

The Country of the Blind

Plaintext for H. G. Wells' "The Country of the Blind"

Global Air Navigation Aids

This site was created by David Megginson, a private pilot and frequent airline passenger. This site also takes advantage of many other people's work

The Poison Belt

Plaintext for Arthur Conan Doyle's "The Poison Belt"

Dracula

Plaintext for Bram Stoker's "Dracula"

The Status Civilization

Plaintext for Robert Sheckley's "The Status Civilization"

Community Development Block Grant Activity by Tract

HUD's PD&R (Office of Policy Development and Research) is responsible for maintaining current information on housing needs, market conditions, and existing programs, as well as conducting research on priority housing ...

Sample Data: Stackloss Plant

Brownlee's stack loss plant data.

An Apology for the Life of Mrs. Shamela Adams

Plaintext for Henry Fielding's "An Apology for the Life of Mrs. Shamela Adams"

Organ Transplants by Country

Organ Donation and Transplantation Activities

The Devil's Dictionary

Plaintext for Ambrose Bierce's "The Devil's Dictionary"

US Fatal Injuries 1999-2014

Dataset of deaths and crude rates of fatal injuries in the United States from 1999 to 2014

New York City Elevators

List of registered elevator devices in New York City

Ozma of Oz

Plaintext for L. Frank Baum's "Ozma of Oz"

The Island of Dr. Moreau

Plaintext for H. G. Wells' "The Island of Dr. Moreau"

Rilla of Ingleside

Plaintext for Lucy Maud Montgomery's "Rilla of Ingleside"

UFO Sightings 2015

Dataset of UFO sightings in the United States in 2015

The Good Soldier: A Tale of Passion

Plaintext for Ford Madox Ford's "The Good Soldier: A Tale Of Passion"

The Federalist Papers

Plaintext for Alexander Hamilton, James Madison, and John Jay's "The Federalist Papers"

Child Mortality from Malaria

Child mortality numbers caused by malaria by country.

Sample Data: Movie Review Sentence Polarity

Movie review data

The Awakening

Plaintext for Kate Chopin's "The Awakening"

Mary Barton

Plaintext for Elizabeth Cleghorn Gaskell's "Mary Barton"

Sample Data: Ozone Concentrations

Daily maximum ozone concentrations at Stamford, Connecticut and Yonkers, New York, during the period May 1, 1974 to September 30, 1974, recorded in parts per billion (ppb).

Spoken Digit Commands

A dataset consisting of recordings of spoken digits

The Mad King

Plaintext for Edgar Rice Burroughs' "The Mad King"

Moll Flanders

Plaintext for Daniel Defoe's "Moll Flanders"

Kidnapped

Plaintext for Robert Louis Stevenson's "Kidnapped"

The Trial

Plaintext for Franz Kafka's "The Trial"

Stack Overflow Survey 2016

Results from Stack Overflow's 2016 Developer Survey

Prussian Horse Kick Data

Number of soldiers in the Prussian cavalry killed by horse kicks

The Magic of Oz

Plaintext for L. Frank Baum's "The Magic of Oz"

The Moonstone

Plaintext for Wilkie Collins' "The Moonstone"

Uncle Tom's Cabin; or, Life Among The Lowly

Plaintext for Harriet Beecher Stowe's "Uncle Tom's Cabin"

Sample Data: Wolf Sunspot Numbers

Data for yearly averages of sunspot numbers originaly compiled by Johann Rudolf Wolf (1770-1869).

The Adventures of Tom Sawyer

Plaintext for Mark Twain's "The Adventures of Tom Sawyer"

Sample Data: Psychiatric Patient Deaths

Death times of psychiatric patients

GMM-3 Mars Gravity Map

Goddard Mars Model 3 map of the gravity field of Mars

FAA Wildlife Strikes

All reports of birds and other wildlife striking aircraft in the U.S. since 1990

A Little Princess

Plaintext for Frances Hodgson Burnett's "A Little Princess"

The History of Mr. Polly

Plaintext for H. G. Wells' "The History of Mr. Polly"

Commuter-Adjusted Daytime Population by U.S. County

2006-2010 data on daytime commuting patterns in the U.S.

Sample Data: University Salaries

Salaries from three university campuses

The Mysteries of Udolpho

Plaintext for Ann Radcliffe's "The Mysteries of Udolpho"

The Idiot

Plaintext for Fyodor Dostoevsky's "The Idiot"

Sample Data: Soporific Drugs

Data showing the effect of two soporific drugs on 10 patients regarding to the increase in hours of sleep compared to control.

Sample Data: Boston Homes

Home values for 506 Boston suburbs with potential influential factors.

Free Code Camp New Coder Survey 2016

Results from Free Code Camp's 2016 survey of new coders

Washington, D.C. Metro Bus Stops

The District provides a large quantity of government information available to the public. The Open Data Catalog provides hundreds of District government datasets, available as raw downloads in a variety of formats, an...

Federalist No. 10

Text of Federalist Paper No. 10 or "The Same Subject Continued: The Union as a Safeguard Against Domestic Faction and Insurrection."

Atlantic Hurricane Data 1851-2017

A modification of the NOAA "Hurdat2" Dataset on Atlantic Hurricanes to facilitate use with the Wolfram Language

Genesis – King James Version

The Book of Genesis (King James Version)

The White Company

Plaintext for Arthur Conan Doyle's "The White Company"

Philoctetes

Plaintext for Sophocles' "Philoctetes"

The Wonderful Wizard of Oz

Plaintext for Frank Baum's "The Wonderful Wizard of Oz"

The Outlaw of Torn

Plaintext for Edgar Rice Burroughs' "The Outlaw of Torn"

VEHICLe

virtual exploratory heterocyclic library

Sample Data: Rainfall Seeding

Rainfall in acre-feet from 52 clouds, of which 26 were chosen randomly and seeded with silver oxide.

Sample Data: Time to AIDS Induction

Time to AIDS in years for adults and children from 1978

Sample Data: Cement Heat Evolution

Heat evolved by setting cements

Penrod and Sam

Plaintext for Booth Tarkington's "Penrod and Sam"

The Return of Tarzan

Plaintext for Edgar Rice Burroughs' "The Return of Tarzan"

Head Start Locations

Full list of currently active Head Start Program locations

The Red Badge of Courage

Plaintext for Stephen Crane's "The Red Badge of Courage"

Tom Brown's Schooldays

Plaintext for Thomas Hughes' "Tom Brown's Schooldays"

The Waste Land

Plaintext for T. S. Eliot's "The Waste Land"

Dorothy and the Wizard in Oz

Plaintext for L. Frank Baum's "Dorothy and the Wizard in Oz"

California Crop Mapping

Dataset of agricultural land use and irrigated acres in California

The Life and Adventures of Nicholas Nickleby

Plaintext for Charles Dickens' "The Life and Adventures of Nicholas Nickleby"

Wives and Daughters

Plaintext for Elizabeth Cleghorn Gaskell's "Wives and Daughters"

US High School Dropouts by Sex and Race

Rates of students who dropped out of high school, over time, by race/ethnicity and gender

Sample Data: Employee Attitude Survey

Employee attitude data for 30 departments in a large financial organization

The First Men in the Moon

Plaintext for H.G. Wells' "The First Men in the Moon"

Lord Jim

Plaintext for Joseph Conrad's "Lord Jim"

War and Peace

Plaintext for Leo Tolstoy's "War and Peace"

Sample Data: Formaldehyde Statistics

Formaldehyde concentration analysis obtained in a chemical experiment.

The American

Plaintext for Henry James' "The American"

Jungle Tales of Tarzan

Plaintext for Edgar Rice Burroughs' "Jungle Tales of Tarzan"

The World Set Free

Plaintext for H. G. Wells' "The World Set Free"

Sample Data: Singer Heights

Heights in inches of the singers in the New York Choral Society in 1979 grouped by their voice parts.

Rat Brain Graph 1

Rattus norvegicus brain network graph

Sample Image: Giraffe Graphic

Sample CMYK Image of a Giraffe

Sample Data: Kidney Transplant

Time to death of 863 kidney transplant patients.

Sample Data: Speed of Light

Michaelson and Morley's speed of light data

Sample Data: Life Cycle Savings

Savings data averaged over the decade from 1960 to 1970 to remove the business cycle or other short term fluctuations.

Nostromo

Plaintext for Joseph Conrad's "Nostromo"

The Sheik

Plaintext for Edith Maude Hull's "The Sheik"

A Far Country

Plaintext for Winston Churchill's "A Far Country"

Green Mansions: A Romance of the Tropical Forest

Plaintext for W. H. Hudson's "Green Mansions"

The Moon and Sixpence

Plaintext for W Somerset Maugham's "TThe Moon and Sixpence"

Food Access Research Atlas

The USDA's comprehensive report on food accessibility in America

FDIC Institution EntityStore

A Wolfram Language EntityStore with selected data on FDIC insured institutions

The Chessmen of Mars

Plaintext for Edgar Rice Burroughs' "The Chessmen of Mars"

The Virginian

Plaintext for Owen Wister's "The Virginian"

Out of Time's Abyss

Plaintext for Edgar Rice Burroughs' "Out of Time's Abyss"

The Way of All Flesh

Plaintext for Samuel Butler's "The Way of All Flesh"

The Voyage of the Beagle

Plaintext for Charles Darwin's "The Voyage of the Beagle"

Historical US Legislators

Members of the United States Congress, 1789-present

The Adventures of Huckleberry Finn

Plaintext for Mark Twain's "The Adventures of Huckleberry Finn"

SQuAD v2.0

A dataset for question answering and reading comprehension from a set of Wikipedia articles

NASA GISTEMP Global Means dTs

Global-mean monthly, seasonal, and annual temperatures from dTs since 1880.

Sample Data: 1993 US Cars

Data from 93 cars, selected at random, on sale in the US in 1993 with 27 variables

The Iron Woman

Plaintext for Margaret Deland's "The Iron Woman"

Fog Frequency by WMO Station

Average annual and monthly number of days with fog for 22 nations

Starman's Quest

Plaintext for Robert Silverberg's "Starman's Quest"

Sample Data: Warp Breaks

Number of breaks in yarn during weaving per loom.

Sample Data: Fisher's Cats

The heart and body weights of samples of male and female cats used for digitalis experiments. The cats were all adult, over 2 kg body weight.

More New Arabian Nights: The Dynamiter

Plaintext for Robert Louis Stevenson and Fanny Vandegrift's "More New Arabian Nights: The Dynamiter"

City of Champaign Street Signs

The various street signs of Champaign, IL. Data on signs include ownership, size, type, etc.

Road Traffic Fatalities by Type 2013

Road traffic deaths by type of road user and country from 2013.

White Fang

Plaintext for Jack London's "White Fang"

Tarzan of the Apes

Plaintext for Edgar Rice Burroughs' "Tarzan of the Apes"

Sample Data: Old Faithful Eruptions

Applied Statistics is a journal of international repute for statisticians both inside and outside the academic world.

Rebecca of Sunnybrook Farm

Plaintext for Kate Douglas Wiggin's "Rebecca Of Sunnybrook Farm"

Wind Storms in the U.S., 1955-2015

Wind Storms tracked by NOAA from 1955-2015

Beyond Lies the Wub

Plaintext for Philip K. Dick's "Beyond Lies the Wub"

The Warlord of Mars

Plaintext for Edgar Rice Burroughs' "The Warlord of Mars"

Anne of Avonlea

Plaintext for Lucy Maud Montgomery's "Anne of Avonlea"

Political Party Platforms

U.S. political party platforms through 2012

Equity in U.S. GED Programs

A report on the equity of school districts and their GED programs across the U.S. during the 2013-14 school year

The Vicar of Wakefield

Plaintext for Oliver Goldsmith's "The Vicar of Wakefield"

Main Street

Plaintext for Sinclair Lewis' "Main Street"

The Prince

Plaintext for Niccolò Machiavelli's "The Prince"

A Connecticut Yankee in King Arthur's Court

Plaintext for Mark Twain's "A Connecticut Yankee in King Arthur's Court"

U.S. Farmers Markets

A comprehensive directory of U.S. farmers markets

The Haunted Man and the Ghost's Bargain: A Fancy for Christmas­Time

Plaintext for Charles Dickens's "The Haunted Man and the Ghost's Bargain: A Fancy for Christmas Time"

United States Supreme Court Decisions 1946-present

Datasets relating to Supreme Court cases from 1946 to present

Spacecraft Materials Outgassing Data

Data was obtained at the Goddard Space Flight Center (GSFC), utilizing equipment developed at Stanford Research Institute (SRI) under contract to the Jet Propulsion Laboratory (JPL).

The Sea­Hawk

Plaintext for Rafael Sabatini's "The Sea-Hawk"

NASA GISTEMP Global Means

Global-mean monthly, seasonal, and annual temperatures since 1880.

Gettysburg Address

The Gettysburg Address, by Abraham Lincoln

Sample Data: Swiss Fertility

Swiss fertility and socioeconomic indicators (1888)

O Pioneers!

Plaintext for Willa Cather's "O Pioneers!"

Paradise Regained

Plaintext for John Milton's "Paradise Regained"

The Life and Opinions of Tristram Shandy, Gentleman

Plaintext for Laurence Sterne's "The Life and Opinions of Tristram Shandy, Gentleman"

Sample Data: Swiss Bank Notes

Six measurements made on 100 genuine Swiss bank notes and 100 counterfeit ones.

A Room with a View

Plaintext for E. M. Forster's "A Room with a View"

Sample Data: Anscombe Regression Lines

Anscombe's 4 regression line data

Dr. Jekyll and Mr. Hyde

Plaintext for Robert Louis Stevenson's "Dr. Jekyll and Mr. Hyde"

Tess of the d'Urbervilles

Plaintext for Thomas Hardy's "Tess of the d'Urbervilles"

Sample Data: DNase Assay

Data about the ELISA assay for the recombinant protein DNase in rat serum.

Sample Data: Channing House

The use of counting process methodology has allowed for substantial advances in the statistical theory to account for censoring and truncation in survival experiments. This book makes these complex methods more access...

Godfrey Morgan

Plaintext for Jules Verne's "Godfrey Morgan"

Silas Marner: The Weaver of Raveloe

Plaintext for George Eliot's "Silas Marner"

US Public Housing Authorities 2016

2016 HUD public housing authority data

Oedipus the King

Plaintext for Sophocles' "Oedipus the King"

Unleavened Bread

Plaintext for Robert Grant's "Unleavened Bread"

Just So Stories

Plaintext for Rudyard Kipling's "Just So Stories"

Love and Mr. Lewisham

Plaintext for H. G. Wells' "Love and Mr. Lewisham"

Sample Data: Chicken Weight

Biometrika is primarily a journal of statistics in which emphasis is placed on papers containing original theoretical contributions of direct or potential value in applications.

Christine

Plaintext for Alice Cholmondeley's "Christine"

The Metamorphosis

Plaintext for Franz Kafka's "The Metamorphosis"

The Chimes

Plaintext for Charles Dickens' "The Chimes"

Sample Audio: Apollo 11 One Small Step

Sample recording of Neil Armstrong's first words from the surface of the moon

Pygmalion

Plaintext for George Bernard Shaw's "Pygmalion"

The Lost World

Plaintext for Arthur Conan Doyle's "The Lost World"

Sample Data: Beaver Body Temperatures

Body temperature series for a female beaver

A Princess of Mars

Plaintext for Edgar Rice Burroughs' "A Princess of Mars"

Sample Data: Buffalo Snow

Snow accumulations in Buffalo

North and South

Plaintext for Elizabeth Cleghorn Gaskell's "North and South"

Tono­Bungay

Plaintext for H. G. Wells' "Tono­Bungay"

Sample Data: Mark Twain Authorship

Distribution of word lengths for Mark Twain and Quintus Curtius Snodgrass.

The Mucker

Plaintext for Edgar Rice Burroughs' "The Mucker"

CFPB Consumer Complaint Database 2013-14

Complaints about financial products and services to companies for response

Repetition Periods for Elementary Cellular Automata

A collection of rules and their repetition periods as a function of size

Sample Data: Lymphoma Marrow Transplants

Bone marrow transplants for Hodgkin's and non-Hodgkin's lymphoma patients

The Adventures of Sherlock Holmes

Plaintext for Arthur Conan Doyle's "The Adventures of Sherlock Holmes"

When Knighthood Was in Flower

Plaintext for Charles Major's "When Knighthood Was in Flower"

Sample Data: Birth Weight Risk

9 potential risk factors for low birth weight with birth weight outcomes.

USDA Rural Housing Active Projects

Data on the USDA's rental properties and labor housing types

Sample Data: Earthquake Waiting Times

The time in days between serious (magnitude at least 7.5 or over 1000 fatalities) earthquakes worldwide, from 12/16/1902 to 3/4/1977

Washington, D.C. Metro Stations

The District provides a large quantity of government information available to the public. The Open Data Catalog provides hundreds of District government datasets, available as raw downloads in a variety of formats, an...

SNAP Retailers

A comprehensive list of all retailers accepting SNAP payments in the U.S.

Barchester Towers

Plaintext for Anthony Trollope's "Barchester Towers"

Sample Data: Mercury Vapor Pressure

Relation of mercury vapor pressure vs temperature.

The Cricket on the Hearth: A Fairy Tale of Home

Plaintext for Charles Dickens' "The Cricket on the Hearth: A Fairy Tale of Home"

Ulysses

Plaintext for James Joyce's "Ulysses"

Fireballs and Bolides

Data on several of the brightest fireballs and bolides that were detected from 2009-2015 by U.S. Government sensors

Anne of Green Gables

Plaintext for Lucy Maud Montgomery's "Anne of Green Gables"

The Wind in the Willows

Plaintext for Kenneth Grahame's "The Wind in the Willows"

The Phantom of the Opera

Plaintext for Gaston Leroux's "The Phantom of the Opera"

USDA Aggregate Tenant Data on Active Properties

Demographics and information on the USDA's active tenant properties

The Virginians

Plaintext for William Makepeace Thackeray's "The Virginians"

The Hand of Ethelberta

Plaintext for Thomas Hardy's "The Hand of Ethelberta"

The Secret Garden

Plaintext for Frances Hodgson Burnett's "The Secret Garden"

Sample Data: Ceramic Strength

Effect of machining factors on the strength of ceramics

CFPB Consumer Complaint Database 2014-15

Complaints about financial products and services to companies for response

Paul Revere's Social Network in Colonial Boston

Dataset of associations among political groups in colonial Boston 1762 - 1775

The Royal Book of Oz

Plaintext for Ruth Plumly Thompson's "The Royal Book of Oz"

Sample Image: Happy Family in Red

Sample Image of a Happy Family

At the Back of the North Wind

Plaintext for George MacDonald's "At the Back of the North Wind"

Thuvia, Maid of Mars

Plaintext for Edgar Rice Burroughs' "Thuvia, Maid of Mars"

Moonfleet

Plaintext for J. Meade Falkner's "Moonfleet"

The Time Machine

Plaintext for H. G. Wells' "The Time Machine"

Sample Data: Esophageal Cancer

Case­control study of esophageal cancer in Ile­et­Vilaine

Sense and Sensibility

Plaintext for Jane Austen's "Sense and Sensibility"

Quo Vadis: A Narrative of the Time of Nero

Plaintext for Henryk Sienkiewicz's "Quo Vadis"

The War in the Air

Plaintext for H. G. Wells' "The War in the Air"

Child Mortality Rates by Prematurity

Child mortality rates caused by premature or preterm birth by country.

The Thirty­Nine Steps

Plaintext for John Buchan's "The Thirty­Nine Steps"

Sample Data: CPU Performance

A relative performance measure and characteristics of 209 CPUs.

Commuter-Adjusted Daytime Population by U.S. Place

2006-2010 data on daytime commuting patterns in the U.S.

The Wild Ass's Skin

Plaintext for Honoré de Balzac's "The Wild Ass's Skin"

Presidential Inaugural Addresses

The American Presidency Project (americanpresidency.org), was established in 1999 as a collaboration between John T. Woolley & Gerhard Peters at the University of California, Santa Barbara.

The Wrong Box

Plaintext for Lloyd Osbourne's "The Wrong Box"

BMI by Country

Global Health Observatory data repository

The Jungle Book

Plaintext for Rudyard Kipling's "The Jungle Book"

Sample Data: Otitis Media

The effects of a drug on 50 children with a history of otitis media in the Northern Territory of Australia.

Sample Data: Car Stopping Distances

Data on the relation between the speed of the car and the distance for the car to stop.

Northanger Abbey

Plaintext for Jane Austen's "Northanger Abbey"

Sample Data: Black Cherry Trees

Girth, height and volume of black cherry trees

Cranford

Plaintext for Elizabeth Cleghorn Gaskell's "Cranford"

The Return of Sherlock Holmes

Plaintext for Arthur Conan Doyle's "The Return of Sherlock Holmes"

State of the Union Addresses

Corpus of all the State of the Union addresses from 1790 to 2019.

Michel Strogoff

Plaintext for Jules Verne's "Michel Strogoff"

Little Lord Fauntleroy

Plaintext for Frances Hodgson Burnett's "Little Lord Fauntleroy"

Low Income Housing Tax Credit Properties

HUD's PD&R (Office of Policy Development and Research) is responsible for maintaining current information on housing needs, market conditions, and existing programs, as well as conducting research on priority housing ...

Sample Data: Female Heights And Weights

Data on the average heights and weights for American women aged between 30 to 39.

An Inquiry into the Nature and Causes of the Wealth of Nations

Plaintext for Adam Smith's "The Wealth of Nations"

Rhesus Cerebral Cortex Graph 1

Connectome of rhesus macaque cerebral cortex

The Awakening of Helena Ritchie

Plaintext for Margaret Deland's "The Awakening of Helena Ritchie"

Sample Image: White Dog on a Beach

Sample Image of a Dog at the Beach

The Story of Mankind

Plaintext for Hendrik Willem van Loon's "The Story of Mankind"

The Helmet of Navarre

Plaintext for Bertha Runkle's "The Helmet of Navarre"

The Hound of the Baskervilles

Plaintext for Arthur Conan Doyle's "The Hound Of The Baskervilles"

Mr Standfast

Plaintext for John Buchan's "Mr. Standfast"

Typee: A Peep at Polynesian Life

Plaintext for Herman Melville's "Typee: A Peep at Polynesian Life"

SQuAD v1.1

A dataset for question answering and reading comprehension from a set of Wikipedia articles

Sample Data: Larynx Cancer

The use of counting process methodology has allowed for substantial advances in the statistical theory to account for censoring and truncation in survival experiments. This book makes these complex methods more access...

The Land That Time Forgot

Plaintext for Edgar Rice Burroughs' "The Land That Time Forgot"

The Food of the Gods

Plaintext for H. G. Wells' "The Food of the Gods"

Presidential Nomination Acceptance Speeches

Presidential nomination acceptance speeches

The Way We Live Now

Plaintext for Anthony Trollope's "The Way We Live Now"

Sample Image: Orange Butterfly on a Purple Flower

Sample Image of a Butterfly

Sample Data: Fisher's Irises

Fisher's iris data

Sample Data: Gilgai Soil

Line transect of soil in Gilgai territory.

Pride and Prejudice

Plaintext for Jane Austen's "Pride and Prejudice"

Sample Data: US Arrests

Number of arrests per 100,000 residents for assault, murder and rape in each of 50 U.S. states in 1973 and percentage of population living in urban areas for each state

The Scarlet Letter

Plaintext for Nathaniel Hawthorne's "The Scarlet Letter"

The Memoirs of Sherlock Holmes

Plaintext for Arthur Conan Doyle's "The Memoirs Of Sherlock Holmes"

The Golden Ass

Plaintext for Apuleius' "The Golden Ass"

Sample Data: Crab Measures

Five morphological measurements of two varieties of both sexes of crab species Leptograpsus variegatus from Fremantle, W. Australia.

Our Mutual Friend

Plaintext for Charles Dickens' "Our Mutual Friend"

Black Indies

Plaintext for Jules Verne's "Black Indies"

HCAHPS Patient Care Survey

Responses from a standardized survey on the quality of American hospital care

Narrative of the Life of Frederick Douglass

Plaintext for Frederick Douglass' "Narrative of the Life of Frederick Douglass"

The Home and the World

Plaintext for Rabindranath Tagore's "The Home and the World"

Sample Data: Australian AIDS

Data on patients diagnosed with AIDS in Australia before July 1, 1991

Micah Clarke

Plaintext for Arthur Conan Doyle's "Micah Clarke"

The Trail of the Lonesome Pine

Plaintext for John Fox, Jr.'s "The Trail of the Lonesome Pine"

Dear Enemy

Plaintext for Jean Webster's "Dear Enemy"

The Clouds

Plaintext for Aristophanes' "The Clouds"

The Importance of Being Earnest

Plaintext for Oscar Wilde's "The Importance of Being Earnest"

NYC Motor Vehicle Collisions

Motor vehicles collisions between road users reported to the NYPD.

Robinson Crusoe

Plaintext for Daniel Defoe's "Robinson Crusoe"

The Jewels of Aptor

Plaintext for Samuel R. Delany's "The Jewels of Aptor"

This Side of Paradise

Plaintext for F. Scott Fitzgerald's "This Side of Paradise"

Sample Data: Scottish Hill Races

Record times in Scottish hill races

Supreme Court Justice Database

A database on individual U.S. Supreme Court Justices and various variables including personalities and service on the Court's bench.

Little Fuzzy

Plaintext for H. Beam Piper's "Little Fuzzy"

The Sorrows of Young Werther

Plaintext for Johann Wolfgang von Goethe's "The Sorrows of Young Werther"

Infectious Diseases by Country 2009-2014

Number of reported, suspected and reported, and/or newly reported cases to the World Health Organization of selected contagious or infectious diseases like mumps and rubella by country from 2009 to 2014.

A Hero of Our Time

Plaintext for Mikhail Lermontov's "A Hero of Our Time"

The Napoleon of Notting Hill

Plaintext for G. K. Chesterton's "The Napoleon of Notting Hill"

A Study in Scarlet

Plaintext for Arthur Conan Doyle's "A Study in Scarlet"

Sample Data: Cushing's Syndrome Testing

Diagnostic tests on patients with Cushing's syndrome

Natural Amenities by U.S. County

A 1999 study measuring desirable natural characteristics in U.S. counties

Moby Dick

Plaintext for Herman Melville's "Moby Dick"

Sons and Lovers

Plaintext for D. H. Lawrence's "Sons and Lovers"

Gridded World Population Density

UN-adjusted gridded world population density for the years 2000, 2005, 2010, and 2015

She

Plaintext for H. Rider Haggard's "She"

Oliver Twist

Plaintext for Charles Dickens' "Oliver Twist"

Greenmantle

Plaintext for John Buchan's "Greenmantle"

Sister Carrie

Plaintext for Theodore Dreiser's "Sister Carrie"

Rinkitink in Oz

Plaintext for L. Frank Baum's "Rinkitink in Oz"

Sample Data: GAG Urine Levels

Level of GAG in urine of children

Master of the World

Plaintext for Jules Verne's "Master of the World"

Tanglewood Tales

Plaintext for Nathaniel Hawthorne's "Tanglewood Tales"

Dust Frequency by WMO Station

Average annual and monthly number of days with dusty weather for Iran, Jordan, and Saudi Arabia

The Worst Journey in the World

Plaintext for Apsley Cherry­Garrard's "The Worst Journey in the World"

One of Ours

Plaintext for Willa Cather's "One of Ours"

C-BARQ Survey

The C-BARQ (or Canine Behavioral Assessment and Research Questionnaire) is designed to provide dog owners and professionals with standardized evaluations of canine temperament and behavior

Anthem

Plaintext for Ayn Rand's "Anthem"

Alice Adams

Plaintext for Booth Tarkington's "Alice Adams"

Sample Data: Cabbages

Data from 60 cabbages with measures of ascorbic acid content, weight, planting season and cultivar

The Patchwork Girl of Oz

Plaintext for L. Frank Baum's "The Patchwork Girl of Oz"

The Monk

Plaintext for Matthew Gregory Lewis' "The Monk"

Thunder Frequency by WMO Station

Average annual and monthly number of days with thunder

The Jewish State

Plaintext for Theodor Herzl's "The Jewish State"

The Stolen Bacillus and Other Incidents

Plaintext for H. G. Wells's "The Stolen Bacillus and Other Incidents"

The Sign of the Four

Plaintext for Arthur Conan Doyle's "The Sign Of The Four"

Seventeen

Plaintext for Booth Tarkington's "Seventeen"

The House of the Seven Gables

Plaintext for Nathaniel Hawthorne's "The House of the Seven Gables"

Power Density in Biological and Astronomical Systems

Power density in various systems in the universe for a comparison of their thermodynamics efficiency

Bartleby, the Scrivener: A Story of Wall­Street

Plaintext for Herman Melville's "Bartleby, the Scrivener: A Story of Wall­Street"

The Jewel of Seven Stars

Plaintext for Bram Stoker's "The Jewel of Seven Stars"

Twenty Thousand Leagues Under the Seas

Plaintext for Jules Verne's "Twenty Thousand Leagues Under the Sea"

An Enemy of the People

Plaintext for Henrik Ibsen's "An Enemy of the People"

Barnaby Rudge: A Tale of the Riots of 'Eighty

Plaintext for Charles Dickens' "A Tale of the Riots of 'Eighty"

On the Nature of Things

On the Nature of Things, by Lucretius

Black Beauty: His Grooms and Companions: The autobiography of a horse

Plaintext for Anna Sewell's "Black Beauty"

Of Human Bondage

Plaintext for W. Somerset Maugham's "Of Human Bondage"

Executions in the United States

A dataset about executions (the death penalty) in the United States since the 1976 Supreme Court decision in Gregg v. Georgia (428 U.S. 153)

The Mystery of Cloomber

Plaintext for Arthur Conan Doyle's "The Mystery of Cloomber"

Declaration of Independence

Text of the Declaration of Independence of the United States of America

The Return of the Native

Plaintext for Thomas Hardy's "The Return of the Native"

The Second Jungle Book

Plaintext for Rudyard Kipling's "The Second Jungle Book"

A Tale of Two Cities

Plaintext for Charles Dickens' "A Tale of Two Cities"

Pollyanna

Plaintext for Eleanor H. Porter's "Pollyanna"

Road Traffic Fatalities by Type 2010

Road traffic deaths by type of road user and country from 2010.

The Plastic Age

Plaintext for Percy Marks' "The Plastic Age"

Utopia

Plaintext for Sir Thomas More's "Utopia"

New Grub Street

Plaintext for George Gissing's "New Grub Street"

Child Mortality Numbers by Measles 2015

Child mortality numbers caused by measles by country.

Top Oil Fields 2001

Top producing oil fields in 2001

Tarzan the Untamed

Plaintext for Edgar Rice Burroughs' "Tarzan the Untamed"

Leviathan

Plaintext for Thomas Hobbes's "Leviathan"

Global Landslide Catalog

The Global Landslide Catalog considers all types of mass movements triggered by rainfall, which have been reported in the media, disaster databases, scientific reports, or other sources.

Simon Called Peter

Plaintext for Robert Keable's "Simon Called Peter"

Overcast Frequency by WMO Station

Average annual and monthly number of days without sunshine

The Turn of the Screw

Plaintext for Henry James's "The Turn of the Screw"

Dorothy Vernon of Haddon Hall

Plaintext for Charles Major's "Dorothy Vernon of Haddon Hall"

The Life and Adventures of Martin Chuzzlewit

Plaintext for Charles Dickens' "The Life and Adventures of Martin Chuzzlewit"

U.S. Baby Name Trends By State

A comprehensive list of time series for baby names in the U.S, listed by state, since 1910

U.S. Suicide Rates by County

The Underlying Cause of Death data available on the CDC's WONDER database are county-level national mortality and population data spanning the years 1999-2014. Data are based on death certificates for U.S. residents.

Beyond Thirty

Plaintext for Edgar Rice Burroughs' "Beyond Thirty"

The Valley of Fear

Plaintext for Arthur Conan Doyle's "The Valley of Fear"

Persuasion

Plaintext for Jane Austen's "Persuasion"

Le Cid

Plaintext for Pierre Corneille's "Le Cid"

Sample Image: Red Cherry Tomatoes

Sample Image in the CIE LAB Colorspace

Sample Data: Bone Marrow Leukemia

Bone marrow transplantation for Leukemia

Second Variety

Plaintext for Philip K. Dick's "Second Variety"

The Rosary

Plaintext for Florence L. Barclay's "The Rosary"

Rain Frequency by WMO Station

Average annual and monthly number of days with rain for 28 nations

Just David

Plaintext for Eleanor H. Porter's "Just David"

Little Dorrit

Plaintext for Charles Dickens' "Little Dorrit"

California Urban Water Supplier Monitoring Reports

Monthly reports of the larger urban water suppliers in California on water production and conservation activities, from the State of California's Drinking Water Information Clearinghouse (DRINC)

Kipps, the Story of a Simple Soul

Plaintext for H. G. Wells' "Kipps, the Story of a Simple Soul"

Daniel Deronda

Plaintext for George Eliot's "Daniel Deronda"

The War of the Worlds

Plaintext for H. G. Wells' "The War of the Worlds"

Dubliners

Plaintext for James Joyce's "Dubliners"

Child Mortality from HIV/AIDS

Child mortality numbers caused by HIV/AIDS by country.

Sample Data: Airplane Glass

Time to failure for airplane glass

Sample Data: Animal Weights

Brain and body weights for 28 animal species

US State Income

United States per capita income by state

The Rainbow

Plaintext for D.H. Lawrence's "The Rainbow"

Friends, Romans, Countrymen

Soliloquy by Mark Antony in Act III, scene II of William Shakespeare's "Julius Caesar"

The Magnificent Ambersons

Plaintext for Booth Tarkington's "The Magnificent Ambersons"

2016 EU Referendum Results

EU referendum results

Babbitt

Plaintext for Sinclair Lewis' "Babbitt"

Penrod

Plaintext for Booth Tarkington's "Penrod"

Women in Love

Plaintext for D. H. Lawrence's "Women in Love"

Sample Data: US City Temperature

Average monthly temperatures in degrees Fahrenheit, from January 1964 to December 1973 in 3 different US cities

Eight cousins

Plaintext for Louisa May Alcott's "Eight Cousins"

SQuAD v1.1 Tokens Generated with WL

A list of isolated words and symbols from the SQuAD dataset, which consists of a set of Wikipedia articles labeled for question answering and reading comprehension

Sample Data: Motor Failures

An accelerated life test at each of four temperatures of 10 motorettes.

Sample Data: Airline Passenger Miles

Revenue passenger miles flown by commercial airlines

Where Angels Fear to Tread

Plaintext for E. M. Forster's "Where Angels Fear to Tread"

Commuter-Adjusted Daytime Population by U.S. State

2006-2010 data on daytime commuting patterns in the U.S.

Snowfall Frequency by WMO Station

Average annual and monthly number of days with snowfall for 9 nations

Hamlet

Hamlet, Prince of Denmark, by William Shakespeare