Wolfram Research

The 2016 Atomic Mass Evaluation (AME2016)

Atomic mass list for analysis which contains the elements, mass excess, binding energy, beta-decay energy, atomic mass and more

Notes From Underground

Plaintext for Fyodor Dostoevsky's "Notes from Underground"

CIFAR-100

CIFAR-100 computer-vision training dataset

CIFAR-10

CIFAR-10 computer-vision training dataset

Sample Data: 1993 US Cars

Data from 93 cars, selected at random, on sale in the US in 1993 with 27 variables

Clinical Concepts from Massive Sources of Medical Data

A dataset of medical concepts

MycoDB

Data on plant response to mycorrhizal fungi

Patient Medical Data for Novel Coronavirus COVID-19

Medical records of patients infected with novel coronavirus COVID-19 (This data was imported and made computable on August 31, 2020.)

Sample Data: Abalone Measurements

Predict the age of abalone from physical measurements

Sample Audio: Apollo 11 One Small Step

Sample recording of Neil Armstrong's first words from the surface of the moon

U.S. Baby Name Trends By State

A comprehensive list of time series for baby names in the U.S, listed by state, since 1910

Theorem Network from Euclid's Elements

Graph of interdependence of theorems from Euclid's Elements

Sample Data: Solar System Planets and Moons

Sample dataset containing the mass and radius of planets and moons in the Solar System

NASA GISTEMP Global Means

Global-mean monthly, seasonal, and annual temperatures since 1880.

NASA GISTEMP Global Means dTs

Global-mean monthly, seasonal, and annual temperatures from dTs since 1880.

Child Mortality from Malaria

Child mortality numbers caused by malaria by country.

Cover Image from A New Kind of Science

The cellular automaton evolution shown on the cover of Stephen Wolfram’s A New Kind of Science

U.S. Baby Names By State

A comprehensive list of frequencies of baby names in the U.S, listed by year and state, since 1910

Child Mortality from HIV/AIDS

Child mortality numbers caused by HIV/AIDS by country.

Paleoclimate Data Records Derived from the Vostok Ice Core

Datasets of CO2 concentration and temperature historical records derived from the air and isotopes trapped in the ice core

Sample Data: Satellite

Classify the type of land surface of a scene photographed by the Landsat MSS satellite given four digital images of the scene taken in different spectral bands

Global Landslide Catalog

The Global Landslide Catalog considers all types of mass movements triggered by rainfall, which have been reported in the media, disaster databases, scientific reports, or other sources.

Sample Data: University Salaries

Salaries from three university campuses

Sample Data: UK Lung Disease Deaths

Monthly deaths from lung diseases in the UK

Tornadoes in the U.S., 1950-2015

Tornadoes tracked by NOAA from 1950-2015

D'Arcy Thompson Zoology Museum 3D Anatomical Models

Models from the museum at the University of Dundee, Scotland

Hue Color Gradients

Collection of gradient hues from coolHue

Olympic Games Costs

Costs of the Olympic Games held from 1960 to 2014

Stack Overflow Survey 2016

Results from Stack Overflow's 2016 Developer Survey

Sample Data: Time to AIDS Induction

Time to AIDS in years for adults and children from 1978

US County Suicide Data 1999-2013

Data on suicides by US County from 1999-2013

Wind Storms in the U.S., 1955-2015

Wind Storms tracked by NOAA from 1955-2015

Free Association Norms Network

Network showing the results from a free association norm experiment

LifeWiki Entity Store

More than 1,000 entities from LifeWiki

Europarl English-Spanish Machine Translation Dataset V7

A parallel corpus for machine translation from the proceedings of the European Parliament

United States Supreme Court Decisions 1946-present

Datasets relating to Supreme Court cases from 1946 to present

Europarl English-German Machine Translation Dataset V7

A parallel corpus for machine translation from the proceedings of the European Parliament

State of the Union Addresses

Corpus of all the State of the Union addresses from 1790 to 2019.

HCAHPS Patient Care Survey

Responses from a standardized survey on the quality of American hospital care

Europarl English-Italian Machine Translation Dataset V7

A parallel corpus for machine translation from the proceedings of the European Parliament

Europarl English-French Machine Translation Dataset V7

A parallel corpus for machine translation from the proceedings of the European Parliament

Video Games Until April 2017

Dataset from the Internet Games Database API as of April 2017

Video Games

Dataset from the Internet Games Database API as of June 2018

The Big Mac Index

The Big Mac index, published by The Economist from 2000 to 2018.

Urbana Police Stop Sheets

Police Stop Sheet results from the City of Urbana after 2016

SQuAD v2.0

A dataset for question answering and reading comprehension from a set of Wikipedia articles

Road Traffic Fatalities by Type 2013

Road traffic deaths by type of road user and country from 2013.

SQuAD v1.1

A dataset for question answering and reading comprehension from a set of Wikipedia articles

Road Traffic Fatalities by Type 2010

Road traffic deaths by type of road user and country from 2010.

NYC Rat Sightings

New York City rat sighting service requests, 2010-2016

Nuclear Latency Dataset

Facility-specific information on sensitive nuclear plants constructed from 1939 to 2012

Urbana Police Traffic Stops

Urbana Police traffic stop motivation information from 2012 to September 2018

Structure of Euclid's Elements

Textual information of definitions, common notions, postulates, and theorems from Euclid’s Elements

UK Government Wine and Spirits Consumed

Table of all Wine and Spirits consumed from the Government Hospitality Wine Cellar

Stanford Bunny

3D test model created from scanning a ceramic figurine of a rabbit

Wikipedia Voting Data 2001-2008

Graph of Wikipedia voting data from the inception of Wikipedia till January 2008

Western Europe Grape Harvest

Western Europe 650 year Grape Harvest Data from 1354 to 2007

US Fatal Injuries 1999-2014

Dataset of deaths and crude rates of fatal injuries in the United States from 1999 to 2014

Global Events of Organized Violence

Georeferenced dataset of individual events of organized violence from the Uppsala Conflict Data Program

Meteorite Landings

This comprehensive data set from The Meteoritical Society contains information on all of the known meteorite landings.

Rejected New York License Plates

Data on New York personalized license plate applications received from the New York DMV

Published Papers Per Year on Cellular Automata

A time series of the number of papers published on cellular automata by year from 1974 through 2015

Sample Data: US Earthquakes

Earthquakes in the U.S. from July 28th, 1789 to March 24th, 2010

Sample Data: Cabbages

Data from 60 cabbages with measures of ascorbic acid content, weight, planting season and cultivar

Seven Year Microwave Sky

The detailed, all-sky picture of the infant universe created from seven years of WMAP data

Sample Data: Rainfall Seeding

Rainfall in acre-feet from 52 clouds, of which 26 were chosen randomly and seeded with silver oxide.

Fireballs and Bolides

Data on several of the brightest fireballs and bolides that were detected from 2009-2015 by U.S. Government sensors

Sample Data: US City Temperature

Average monthly temperatures in degrees Fahrenheit, from January 1964 to December 1973 in 3 different US cities

Sample Data: Life Cycle Savings

Savings data averaged over the decade from 1960 to 1970 to remove the business cycle or other short term fluctuations.

Sample Data: Crab Measures

Five morphological measurements of two varieties of both sexes of crab species Leptograpsus variegatus from Fremantle, W. Australia.

Coronavirus COVID-19 Pandemic Government Measures

Measures taken by governments from different countries to fight the coronavirus COVID-19 pandemic caused by the SARS-CoV-2

Lorem Ipsum

"Lorem Ipsum" is filler text in graphic design and publishing derived from Cicero's "On the ends of good and evil", section 1.10.33.

Orbital Variations and Insolation Database

Dataset of insolation values at different latitudes from 5000000 cal yr BP (-4998050 CE) to 0 cal yr BP (1950 CE)

Raw Data For The Long Term Selection Experiment For Oil And Protein In Corn

Raw data from each ear analyzed each year of the Illinois long-term selection experiment for oil and protein in corn (1896-2004)

SQuAD v2.0 Tokens Generated with WL

A list of isolated words and symbols from the SQuAD dataset, which consists of a set of Wikipedia articles labeled for question answering and reading comprehension

California Urban Water Supplier Monitoring Reports

Monthly reports of the larger urban water suppliers in California on water production and conservation activities, from the State of California's Drinking Water Information Clearinghouse (DRINC)

SQuAD v1.1 Tokens Generated with WL

A list of isolated words and symbols from the SQuAD dataset, which consists of a set of Wikipedia articles labeled for question answering and reading comprehension

Sample Data: Earthquake Waiting Times

The time in days between serious (magnitude at least 7.5 or over 1000 fatalities) earthquakes worldwide, from 12/16/1902 to 3/4/1977

Project Tycho Level 1 Data

Data from all weekly notifiable disease reports for the United States dating back to 1888. Level 1 data have been custom tailored for specific analyses.