Wolfram Research

National Science Foundation Grants - 2015

Data on National Science Foundation grants and associated investigators and institutions awarded in the year 2015

Sample Data: Scottish Hill Races

Record times in Scottish hill races

Repetition Periods for Elementary Cellular Automata

A collection of rules and their repetition periods as a function of size

Theorem Network from Euclid's Elements

Graph of interdependence of theorems from Euclid's Elements

Sample Data: Speed of Light

Michaelson and Morley's speed of light data

Sample Data: Mushroom Classification

Determine whether a mushroom is edible based on physical characteristics

NYC Jobs, Aug 2015 - Aug 2016

Information about NYC jobs for Aug. 2015 - Aug. 2016

Sample Data: Esophageal Cancer

Case­control study of esophageal cancer in Ile­et­Vilaine

Prussian Horse Kick Data

Number of soldiers in the Prussian cavalry killed by horse kicks

FDIC Institution EntityStore

A Wolfram Language EntityStore with selected data on FDIC insured institutions

California Urban Water Supplier Monitoring Reports

Monthly reports of the larger urban water suppliers in California on water production and conservation activities, from the State of California's Drinking Water Information Clearinghouse (DRINC)

Sample Data: Anscombe Regression Lines

Anscombe's 4 regression line data

Sample Data: Earthquake Waiting Times

The time in days between serious (magnitude at least 7.5 or over 1000 fatalities) earthquakes worldwide, from 12/16/1902 to 3/4/1977

Sample Data: Puromycin Reaction Velocity

Velocities of enzymatic reactions with or without Puromycin.

Project Tycho Level 1 Data

Data from all weekly notifiable disease reports for the United States dating back to 1888. Level 1 data have been custom tailored for specific analyses.

Timeline of Systematic Data & Computable Knowledge

Dataset of nearly 200 notable events in the history of computable knowledge

Sample Data: Channing House

The use of counting process methodology has allowed for substantial advances in the statistical theory to account for censoring and truncation in survival experiments. This book makes these complex methods more access...

Sample Data: Larynx Cancer

The use of counting process methodology has allowed for substantial advances in the statistical theory to account for censoring and truncation in survival experiments. This book makes these complex methods more access...

Sample Data: Psychiatric Patient Deaths

Death times of psychiatric patients

Sample Data: Bone Marrow Leukemia

Bone marrow transplantation for Leukemia

USDA Aggregate Tenant Data on Active Properties

Demographics and information on the USDA's active tenant properties

Sample Data: US Earthquakes

Earthquakes in the U.S. from July 28th, 1789 to March 24th, 2010

Tweets by @WolframResearch (Datasets for Multiparadigm Data Science Course)

These datasets are used in the example demonstrated in the "Build a Project Workflow" section of the "Multiparadigm Data Science" interactive course on Wolfram U

Sample Data: 1993 US Cars

Data from 93 cars, selected at random, on sale in the US in 1993 with 27 variables

Tweets by @WolframResearch

Data used in the Wolfram U interactive course “Multiparadigm Data Science”

Genetic Sequences for the SARS-CoV-2 Coronavirus

Nucleotide sequences of the SARS-CoV-2 virus (the virus associated with the COVID-19 disease, formerly known as 2019-nCoV) including location, collection time and similar supporting data

Protein Sequences for the SARS-CoV-2 Coronavirus

Protein sequences of the SARS-CoV-2 virus (the virus associated with the COVID-19 disease, formerly known as 2019-nCoV) including location, collection time and similar supporting data

Clinical Concepts from Massive Sources of Medical Data

A dataset of medical concepts

Patient Medical Data for Novel Coronavirus COVID-19

Medical records of patients infected with novel coronavirus COVID-19 (This data was imported and made computable on August 31, 2020.)

MycoDB

Data on plant response to mycorrhizal fungi

The 2016 Atomic Mass Evaluation (AME2016)

Atomic mass list for analysis which contains the elements, mass excess, binding energy, beta-decay energy, atomic mass and more

Equity in U.S. GED Programs

A report on the equity of school districts and their GED programs across the U.S. during the 2013-14 school year

Sample Data: Scientific Discoveries

Number of great scientific discoveries of each year

Coauthorships in Network Science Network

Weighted network of coauthorships between scientists working on network science

Sample Data: Ozone Concentrations

Daily maximum ozone concentrations at Stamford, Connecticut and Yonkers, New York, during the period May 1, 1974 to September 30, 1974, recorded in parts per billion (ppb).

Sample Data: Rainfall Seeding

Rainfall in acre-feet from 52 clouds, of which 26 were chosen randomly and seeded with silver oxide.

Sample Data: Buffalo Snow

Snow accumulations in Buffalo

Australian Rules Football - 2018 Final Team Rankings

This dataset consists of the final team standing within the Australian Rules Football league for the 2018 season

US Hurricane Losses

U.S. losses due to hurricanes in millions of U.S. Dollars

Full Text of A New Kind of Science

The full text of Stephen Wolfram’s A New Kind of Science

Sample Data: US City Temperature

Average monthly temperatures in degrees Fahrenheit, from January 1964 to December 1973 in 3 different US cities

Spacecraft Materials Outgassing Data

Data was obtained at the Goddard Space Flight Center (GSFC), utilizing equipment developed at Stanford Research Institute (SRI) under contract to the Jet Propulsion Laboratory (JPL).

Cover Image from A New Kind of Science

The cellular automaton evolution shown on the cover of Stephen Wolfram’s A New Kind of Science

People Mentioned in Stephen Wolfram’s “A New Kind of Science”

Listing of all people mentioned in historical and other notes in “A New Kind of Science”

Kepler-11 Light Curve Data

Light curve data for planetary system Kepler-11

NYC Major Felony Incidents 2015

Quarterly update of Seven Major Felonies at the incident level. For privacy reasons, incidents have been moved to the midpoint of the street segment on which they occur.

NYC Rat Sightings

New York City rat sighting service requests, 2010-2016

Entity Store of People Mentioned in Stephen Wolfram's A New Kind of Science

An entity store of all people mentioned in historical and other notes in “A New Kind of Science”

Rejected New York License Plates

Data on New York personalized license plate applications received from the New York DMV

Space Shuttle 3D Model

3D test model of a space shuttle

Viking Lander 3D Model

3D test model of a Viking lunar lander

Stanford Bunny

3D test model created from scanning a ceramic figurine of a rabbit

Data for Maymin & Langer (2021) "Cognitive Biases and Mindfulness"

Survey data generated and analysed in the article "Cognitive Biases and Mindfulness" by Philip Z. Maymin and Ellen J. Langer in the Nature journal Humanities and Social Sciences Communications

Global Events of Organized Violence

Georeferenced dataset of individual events of organized violence from the Uppsala Conflict Data Program

Books in Stephen Wolfram's Library

Books in Stephen Wolfram’s library that were used during the creation of A New Kind of Science (Wolfram, 2002).

Epidemic Data for Novel Coronavirus COVID-19

Estimated cases of novel coronavirus (COVID-19, formerly known as 2019-nCoV) infection by country or region (This data was imported and made computable on May 10, 2021)

Sample Data: Satellite

Classify the type of land surface of a scene photographed by the Landsat MSS satellite given four digital images of the scene taken in different spectral bands

Entity Store of Books in Stephen Wolfram's Library

An entity store of books in Stephen Wolfram’s library that were used during in the creation of A New Kind of Science (Wolfram, 2002).

Sample Data: Fisher's Irises

Fisher's iris data

USA Google Mobility Data

Mobility Reports of Google for USA

Minimal Inequivalent Square Tilings

A dataset of images and constraints for the minimal inequivalent square tilings, along with the allowed tiles that generate the tiling

NYC Emergency Response Incidents

NYC Open Data makes the wealth of public data generated by various New York City agencies and other City organizations available for public use. This catalog offers access to a repository of government-produced, machi...

Sample Data: Stackloss Plant

Brownlee's stack loss plant data.

Little Fuzzy

Plaintext for H. Beam Piper's "Little Fuzzy"

Irish-Viking Networks in 'Cogadh Gaedhel re Gallaibh'

Graph datasets for Irish and Viking character relationships in the medieval Irish text 'Cogadh Gaedhel re Gallaibh' ('The War of the Gaedhil with the Gaill')

Sample Data: Australian AIDS

Data on patients diagnosed with AIDS in Australia before July 1, 1991

Sample Data: Chicken Weight

Biometrika is primarily a journal of statistics in which emphasis is placed on papers containing original theoretical contributions of direct or potential value in applications.

Canonical Polyhedra

The canonical forms of polyhedra with 4 to 9 faces

Sample Data: Soporific Drugs

Data showing the effect of two soporific drugs on 10 patients regarding to the increase in hours of sleep compared to control.

Thuvia, Maid of Mars

Plaintext for Edgar Rice Burroughs' "Thuvia, Maid of Mars"

Sample Data: Employee Attitude Survey

Employee attitude data for 30 departments in a large financial organization

Sample Data: Car Stopping Distances

Data on the relation between the speed of the car and the distance for the car to stop.

Sample Data: Water Boiling Points in the Alps

17 observations on the boiling point of water and barometric pressure in inches of mercury.

Dr. Jekyll and Mr. Hyde

Plaintext for Robert Louis Stevenson's "Dr. Jekyll and Mr. Hyde"

Twenty Thousand Leagues Under the Seas

Plaintext for Jules Verne's "Twenty Thousand Leagues Under the Sea"

Sample Data: Wolf Sunspot Numbers

Data for yearly averages of sunspot numbers originaly compiled by Johann Rudolf Wolf (1770-1869).

Sample Data: Life Cycle Savings

Savings data averaged over the decade from 1960 to 1970 to remove the business cycle or other short term fluctuations.

A Voyage to Arcturus

Plaintext for David Lindsay's "A Voyage to Arcturus"

The Invisible Man

Plaintext for H. G. Wells' "The Invisible Man"

The Time Machine

Plaintext for H. G. Wells' "The Time Machine"

Master of the World

Plaintext for Jules Verne's "Master of the World"

Sample Data: Cabbages

Data from 60 cabbages with measures of ascorbic acid content, weight, planting season and cultivar

Sample Data: Plant Growth

Dried weight of plants was used to compare their yields for each of one control group and two treatment groups.

Sample Data: Formaldehyde Statistics

Formaldehyde concentration analysis obtained in a chemical experiment.

Sample Data: Singer Heights

Heights in inches of the singers in the New York Choral Society in 1979 grouped by their voice parts.

Sample Data: Ceramic Strength

Effect of machining factors on the strength of ceramics

Sample Data: Airplane Glass

Time to failure for airplane glass

Anthem

Plaintext for Ayn Rand's "Anthem"

Sample Data: Anorexia Treatment

Anorexia data on weight change for young female patients.

Persistent Structures in the Code 357 Cellular Automaton

A collection of the persistent structures in the k=3, r=1 totalistic code 357 cellular automaton

Block Simulation Network of Elementary Cellular Automata

A network showing how one elementary cellular automaton can emulate another if its states contain only particular blocks

Persistent Structures in the Code 20 Cellular Automaton

The known persistent structures in the k=2, r=2 totalistic code 20 cellular automaton

Persistent Structures in the Code 1329 Cellular Automaton

A collection of the persistent structures in the k=3, r=1 totalistic code 1329 cellular automaton

Sample Data: Animal Weights

Brain and body weights for 28 animal species

Sample Data: Indomethicin Pharmacokinetics

Time series data about concentration of indomethicin on 6 subjects in the indomethicin pharmacokinetics experiment.

Sample Data: Time to AIDS Induction

Time to AIDS in years for adults and children from 1978

Sample Data: Swiss Bank Notes

Six measurements made on 100 genuine Swiss bank notes and 100 counterfeit ones.

Sample Data: DNase Assay

Data about the ELISA assay for the recombinant protein DNase in rat serum.

Orbits of a Planet in a Binary Star System

A collection of orbits for an idealized planet in a binary star system

Sample Data: Female Heights And Weights

Data on the average heights and weights for American women aged between 30 to 39.

NYC Motor Vehicle Collisions

Motor vehicles collisions between road users reported to the NYPD.

Beyond Lies the Wub

Plaintext for Philip K. Dick's "Beyond Lies the Wub"

Sample Data: Motor Failures

An accelerated life test at each of four temperatures of 10 motorettes.

Fireballs and Bolides

Data on several of the brightest fireballs and bolides that were detected from 2009-2015 by U.S. Government sensors

Sample Data: US Arrests

Number of arrests per 100,000 residents for assault, murder and rape in each of 50 U.S. states in 1973 and percentage of population living in urban areas for each state

Sample Data: CPU Performance

A relative performance measure and characteristics of 209 CPUs.

Sample Data: Airline Passenger Miles

Revenue passenger miles flown by commercial airlines

Sample Data: University Salaries

Salaries from three university campuses

Sample Data: Warp Breaks

Number of breaks in yarn during weaving per loom.

Sample Data: Fisher's Cats

The heart and body weights of samples of male and female cats used for digitalis experiments. The cats were all adult, over 2 kg body weight.

Sample Data: Swiss Fertility

Swiss fertility and socioeconomic indicators (1888)

Sample Data: Crab Measures

Five morphological measurements of two varieties of both sexes of crab species Leptograpsus variegatus from Fremantle, W. Australia.

Sample Data: Gilgai Soil

Line transect of soil in Gilgai territory.

Sample Data: Old Faithful Eruptions

Applied Statistics is a journal of international repute for statisticians both inside and outside the academic world.

Sample Data: Black Cherry Trees

Girth, height and volume of black cherry trees

Sample Data: Cushing's Syndrome Testing

Diagnostic tests on patients with Cushing's syndrome

Global Landslide Catalog

The Global Landslide Catalog considers all types of mass movements triggered by rainfall, which have been reported in the media, disaster databases, scientific reports, or other sources.

Sample Data: Beaver Body Temperatures

Body temperature series for a female beaver

Sample Data: Mercury Vapor Pressure

Relation of mercury vapor pressure vs temperature.

Western Europe Grape Harvest

Western Europe 650 year Grape Harvest Data from 1354 to 2007

Sample Data: 2003 US Life Table

A life table for the total United States population in 2003

Sample Data: Guinea Pig Tooth Growth

Data on the length of odontoblasts (teeth) for 10 guinea pigs measured at each of three dose levels of Vitamin C with each of two delivery methods.

Solid Waste Landfill Facilities

Oak Ridge National Laboratory is the largest US Department of Energy science and energy laboratory, conducting basic and applied research to deliver transformative solutions to compelling problems in energy and security.

Sample Data: UCI Letter

Letter recognition dataset

Sample Data: Boston Homes

Housing values in suburbs of Boston

Sample Data: Kidney Transplant

Time to death of 863 kidney transplant patients.

Sample Data: Otitis Media

The effects of a drug on 50 children with a history of otitis media in the Northern Territory of Australia.

Tagged Test Images Network

Bipartite network of tags and images

Persistent Structures in Rule 110

The known families of persistent structures in the Rule 110 elementary cellular automaton

Sample Data: GAG Urine Levels

Level of GAG in urine of children

Three-Color Cellular Automaton Rules that Double Their Input

A list of rules for k=3 cellular automata that eventually double a block of gray input cells

Sample Data: Cement Heat Evolution

Heat evolved by setting cements

Sample Data: Mark Twain Authorship

Distribution of word lengths for Mark Twain and Quintus Curtius Snodgrass.

Near-Earth Comets

J2000 heliocentric ecliptic orbital elements of 160 Near-Earth Comets

Stack Overflow Survey 2016

Results from Stack Overflow's 2016 Developer Survey

Sample Data: Birth Weight Risk

9 potential risk factors for low birth weight with birth weight outcomes.

Sample Data: Pacific Walrus Haulouts

Congregations of Pacific walruses off the coast of U.S. and Russia, 1852-2016

Sample Data: Wine Quality

Quality of white wines given the physical properties of the wines

Atlantic Hurricane Data 1851-2017

A modification of the NOAA "Hurdat2" Dataset on Atlantic Hurricanes to facilitate use with the Wolfram Language

Washington, D.C. Metro Bus Stops

The District provides a large quantity of government information available to the public. The Open Data Catalog provides hundreds of District government datasets, available as raw downloads in a variety of formats, an...

Washington, D.C. Metro Stations

The District provides a large quantity of government information available to the public. The Open Data Catalog provides hundreds of District government datasets, available as raw downloads in a variety of formats, an...

Solutions to Examples of Post’s Correspondence Problem

A dataset of instances and solutions (if they exist) for Post’s correspondence problem

Sample Data: Abalone Measurements

Predict the age of abalone from physical measurements

Sample Data: Movie Review Sentence Polarity

Movie review data

City of Champaign Street Signs

The various street signs of Champaign, IL. Data on signs include ownership, size, type, etc.

New York City Elevators

List of registered elevator devices in New York City

On the Origin of Species

On the Origin of Species By Means of Natural Selection, or, the Preservation of Favoured Races in the Struggle for Life, by Charles Darwin

Epidemic Data for SARS

Cumulative number of reported suspect and probable cases

Top Oil Fields 2001

Top producing oil fields in 2001

Sample Data: Spam Email

Dataset of email statistics for the classification of spam email

Sample Data: Car Evaluation

Predicting car acceptability by attribute.

Large Global Plate Boundaries

Locations and other attributes of boundaries between tectonic plates

US Coal Fields

This dataset represents coal fields in Alaska and the conterminous United States.

Sample Data: Loblolly Tree Growth

Loblolly pine tree growth measurement

Wikipedia Voting Data 2001-2008

Graph of Wikipedia voting data from the inception of Wikipedia till January 2008

Sample Data: Gene Sequences

Splice-junction Gene Sequences for Primate DNA

US County Suicide Data 1999-2013

Data on suicides by US County from 1999-2013

US State Income

United States per capita income by state

US Health Data Breaches

A list of breaches of unsecured protected health information affecting 500 or more individuals

Kyoto Free Translation Task Data

A parallel corpus for the evaluation and development of Japanese-English machine translation systems

US State Fairgrounds

The National Geospatial-Intelligence Agency (NGA) delivers geospatial intelligence that provides a decisive advantage to policymakers, warfighters, intelligence professionals and first responders.

U.S. State Fairgrounds

Locations for United States State and Regional Fairs

Sample Data: Mink Fur Sales

Annual mink fur sales of Hudson's Bay Company

Sample Data: Lymphoma Marrow Transplants

Bone marrow transplants for Hodgkin's and non-Hodgkin's lymphoma patients

Sample Data: UK Lung Disease Deaths

Monthly deaths from lung diseases in the UK

Sample Data: Solar System Planets and Moons

Sample dataset containing the mass and radius of planets and moons in the Solar System

Raw Data For The Long Term Selection Experiment For Oil And Protein In Corn

Raw data from each ear analyzed each year of the Illinois long-term selection experiment for oil and protein in corn (1896-2004)

US Census Mean Household Income Data

Census Bureau

OECD Data: Hospital Beds Per Country

OECD time series data for number of beds for 1000 inhabitants

Paleoclimate Data Records Derived from the Vostok Ice Core

Datasets of CO2 concentration and temperature historical records derived from the air and isotopes trapped in the ice core

Orbital Variations and Insolation Database

Dataset of insolation values at different latitudes from 5000000 cal yr BP (-4998050 CE) to 0 cal yr BP (1950 CE)

1918 'Spanish Flu' Pandemic In Chicago

Point location of influenza and pneumatic deaths and weekly mortality data recorded during 1918 'Spanish flu' pandemic correlated to 1920 census data for Chicago

Actinobacillus Actinomycetemcomitans Metabolic Network

Metabolic cellular network data

Synechocystis Sp Whole Network

Whole cellular network data

Mycoplasma Pneumoniae Metabolic Network

Metabolic cellular network data

Arabidopsis Thaliana Whole Network

Whole cellular network data

Chlamydia Pneumoniae Whole Network

Whole cellular network data

Streptococcus Pyogenes Metabolic Network

Metabolic cellular network data

Mycobacterium Leprae Metabolic Network

Metabolic cellular network data

Clostridium Acetobutylicum Whole Network

Whole cellular network data

Porphyromonas Gingivalis Metabolic Network

Metabolic cellular network data

Caenorhabditis Elegans Metabolic Network

Metabolic cellular network data

Salmonella Typhi Whole Network

Whole cellular network data

Saccharomyces Cerevisiae Metabolic Network

Metabolic cellular network data

Porphyromonas Gingivalis Whole Network

Whole cellular network data

Emericella Nidulans Metabolic Network

Metabolic cellular network data

Oryza Sativa Whole Network

Whole cellular network data

Enterococcus Faecalis Whole Network

Whole cellular network data

Treponema Pallidum Metabolic Network

Metabolic cellular network data

Methanobacterium Thermoautotrophicum Metabolic Network

Metabolic cellular network data

Chlamydia Trachomatis Whole Network

Whole cellular network data

Rhodobacter Capsulatus Whole Network

Whole cellular network data

Archaeoglobus Fulgidus Whole Network

Whole cellular network data

Helicobacter Pylori Whole Network

Whole cellular network data

Mycobacterium Tuberculosis Whole Network

Whole cellular network data

Neisseria Gonorrhoeae Metabolic Network

Metabolic cellular network data

Rickettsia Prowazekii Metabolic Network

Metabolic cellular network data

Salmonella Typhi Metabolic Network

Metabolic cellular network data

Aquifex Aeolicus Metabolic Network

Metabolic cellular network data

Pyrococcus Horikoshii Metabolic Network

Metabolic cellular network data

Pyrococcus Horikoshii Whole Network

Whole cellular network data

Thermotoga Maritima Whole Network

Whole cellular network data

Thermotoga Maritima Metabolic Network

Metabolic cellular network data

Pyrococcus Furiosus Metabolic Network

Metabolic cellular network data

Enterococcus Faecalis Metabolic Network

Metabolic cellular network data

Yersinia Pestis Metabolic Network

Metabolic cellular network data

Neisseria Meningitidis Whole Network

Whole cellular network data

Bacillus Subtilis Metabolic Network

Metabolic cellular network data

Aeropyrum Pernix Whole Network

Whole cellular network data

Actinobacillus Actinomycetemcomitans Whole Network

Whole cellular network data

Treponema Pallidum Whole Network

Whole cellular network data

Chlamydia Pneumoniae Metabolic Network

Metabolic cellular network data

Pseudomonas Aeruginosa Metabolic Network

Metabolic cellular network data

Chlorobium Tepidum Metabolic Network

Metabolic cellular network data

Mycobacterium Tuberculosis Metabolic Network

Metabolic cellular network data

Saccharomyces Cerevisiae Whole Network

Whole cellular network data

Pyrococcus Furiosus Whole Network

Whole cellular network data

Haemophilus Influenzae Whole Network

Whole cellular network data

Bacillus Subtilis Whole Network

Whole cellular network data

Pseudomonas Aeruginosa Whole Network

Whole cellular network data

Streptococcus Pyogenes Whole Network

Whole cellular network data

Clostridium Acetobutylicum Metabolic Network

Metabolic cellular network data

Methanobacterium Thermoautotrophicum Whole Network

Whole cellular network data

Neisseria Meningitidis Metabolic Network

Metabolic cellular network data

Deinococcus Radiodurans Whole Network

Whole cellular network data

Campylobacter Jejuni Whole Network

Whole cellular network data

Mycobacterium Bovis Metabolic Network

Metabolic cellular network data

Campylobacter Jejuni Metabolic Network

Metabolic cellular network data

Chlamydia Trachomatis Metabolic Network

Metabolic cellular network data

Streptococcus Pneumoniae Whole Network

Whole cellular network data

Escherichia Coli Whole Network

Whole cellular network data

Borrelia Burgdorferi Whole Network

Whole cellular network data

Archaeoglobus Fulgidus Metabolic Network

Metabolic cellular network data

Borrelia Burgdorferi Metabolic Network

Metabolic cellular network data

Deinococcus Radiodurans Metabolic Network

Metabolic cellular network data

Streptococcus Pneumoniae Metabolic Network

Metabolic cellular network data

Oryza Sativa Metabolic Network

Metabolic cellular network data

Synechocystis Sp Metabolic Network

Metabolic cellular network data

Methanococcus Jannaschii Metabolic Network

Metabolic cellular network data

Mycoplasma Genitalium Whole Network

Whole cellular network data

Neisseria Gonorrhoeae Whole Network

Whole cellular network data

Yersinia Pestis Whole Network

Whole cellular network data

Chlorobium Tepidum Whole Network

Whole cellular network data

Caenorhabditis Elegans Whole Network

Whole cellular network data

Methanococcus Jannaschii Whole Network

Whole cellular network data

Rhodobacter Capsulatus Metabolic Network

Metabolic cellular network data

Helicobacter Pylori Metabolic Network

Metabolic cellular network data

Escherichia Coli Metabolic Network

Metabolic cellular network data

Haemophilus Influenzae Metabolic Network

Metabolic cellular network data

Mycobacterium Leprae Whole Network

Whole cellular network data

Arabidopsis Thaliana Metabolic Network

Metabolic cellular network data

Emericella Nidulans Whole Network

Whole cellular network data

Mycoplasma Genitalium Metabolic Network

Metabolic cellular network data

Mycobacterium Bovis Whole Network

Whole cellular network data

Aeropyrum Pernix Metabolic Network

Metabolic cellular network data

Mycoplasma Pneumoniae Whole Network

Whole cellular network data

NASA GISTEMP Global Means dTs

Global-mean monthly, seasonal, and annual temperatures from dTs since 1880.

Polyform Database

Data for some of the most popular polyforms

Amniote Life History EntityStore

EntityStore of life history data for birds, mammals, and reptiles

U.S. Suicide Rates by County

The Underlying Cause of Death data available on the CDC's WONDER database are county-level national mortality and population data spanning the years 1999-2014. Data are based on death certificates for U.S. residents.

Amniote Life History Database

Life-history database for a wide variety of amniotes

Power Density in Biological and Astronomical Systems

Power density in various systems in the universe for a comparison of their thermodynamics efficiency

US Public Housing Authorities 2016

2016 HUD public housing authority data

GMM-3 Mars Gravity Map

Goddard Mars Model 3 map of the gravity field of Mars

USDA Rural Housing Active Projects

Data on the USDA's rental properties and labor housing types

Disease Gene Network

A network of disease genes linked by known disorder-gene associations

Nematode Pharynx Graph

Connectome of anterior section of hermaphrodite Caenorhabditis elegans pharynx

High Energy Theory Collaborations Network

Weighted network of collaborations between scientists posting preprints

Protein Interaction network

Protein interaction network

Transcriptional Regulation Network of Escherichia coli

Dataset of the transcriptional regulation network of Escherichia coli

Rat Brain Graph 1

Rattus norvegicus brain network graph

Condensed Matter Collaborations 2005 Network

Updated weighted network of collaborations between scientists posting preprints

Cell Ontology Network

Network of ontology for cell types

High Energy Physics Theory Network

A collaboration and citation network on physics theory

Rhesus Cerebral Cortex Graph 1

Connectome of rhesus macaque cerebral cortex

Condensed Matter Collaborations 1999 Network

Weighted network of collaborations between scientists posting preprints

High Energy Physics Phenomenology Network

A collaboration and citation network on physics phenomenology

Condensed Matter Collaborations 2003 Network

Updated weighted network of collaborations between scientists posting preprints

Budding Yeast Network

Protein-protein interaction network in budding yeast

Medicare Drug Spending

Detailed data on prescription drugs for Medicare Part B and Part D

Commuter-Adjusted Daytime Population by U.S. County

2006-2010 data on daytime commuting patterns in the U.S.

Commuter-Adjusted Daytime Population by U.S. Place

2006-2010 data on daytime commuting patterns in the U.S.

Commuter-Adjusted Daytime Population by U.S. State

2006-2010 data on daytime commuting patterns in the U.S.

United States Hail Storms 1955-2015

Information on severe hail storms in the US, including injury and property loss data

NCI Standard Anticancer Agents

Dataset of the NCI standard anticancer agents

SWEETLEAD Molecule Database

A cheminformatics database of medicines, drugs, and herbal isolates

Meteorite Landings

This comprehensive data set from The Meteoritical Society contains information on all of the known meteorite landings.

Coastline Fractal Dimensions

Fractal dimensions of coastlines of all countries, dependencies, and territories

Linear Codes

Linear codes provide an optimal way for transmitting blocks of data over noisy channels

Bortle scale

Numeric scale that measures the night sky's brightness

Census Tract Entity Store

US Census tracts with location, polygon, and 5-year US Census Bureau ACS data

VEHICLe

virtual exploratory heterocyclic library

NASA GISTEMP Global Means

Global-mean monthly, seasonal, and annual temperatures since 1880.

Pitt Quantum Repository

The Pitt Quantum Repository is a database and website of molecular quantum calculations, including visualization

D'Arcy Thompson Zoology Museum 3D Anatomical Models

Models from the museum at the University of Dundee, Scotland

Dust Frequency by WMO Station

Average annual and monthly number of days with dusty weather for Iran, Jordan, and Saudi Arabia

Thunder Frequency by WMO Station

Average annual and monthly number of days with thunder

Overcast Frequency by WMO Station

Average annual and monthly number of days without sunshine

Rain Frequency by WMO Station

Average annual and monthly number of days with rain for 28 nations

Snowfall Frequency by WMO Station

Average annual and monthly number of days with snowfall for 9 nations

Hospital Beds Per US State

Time series data for number of beds per US state for 1000 inhabitants by ownership type

Sea Level and Temperatures Over the Last 40 Million Years

Dataset of eustatic sea level and temperatures over the last 40 million years

UK Occupation Estimates: Exposure to Generic Disease and Physical Proximity

An estimate of exposure to disease (generally) and physical proximity for UK occupations based on US analysis of these factors, using 2019 data

Seven Year Microwave Sky

The detailed, all-sky picture of the infant universe created from seven years of WMAP data

Wind Speed Measurements

Average daily wind speed at 12 meteorological stations in the Republic of Ireland 1961-1978

UK Crime Incidents, February 2017

Individual crime and anti-social behaviour (ASB) incidents in England, Wales, and Northern Ireland in February 2017

Detailed Average Prices of Consumer Goods in Europe

Average prices per country of well-defined consumer goods and services

State Government Finances 2013

U.S. Census Bureau, 2013 Annual Survey of State Government Finances

CDC's Social Vulnerability Index (SVI) 2018

A collection of social vulnerability factors for each US county

Primitive Polynomials

Primitive polynomials for Galois field generation up to GF(2^1200), GF(3^660), GF(5^430), and GF(7^358)

Urbana Police Traffic Stops

Urbana Police traffic stop motivation information from 2012 to September 2018

DHL Facilities

Information about U.S. DHL facilities, including exact locations

MLS Players' Salaries

The Major League Soccer Players Union serves as the exclusive collective bargaining representative for all current players in Major League Soccer. Formed in April 2003, the Union ensures protection of the rights of al...

Tornadoes in the U.S., 1950-2015

Tornadoes tracked by NOAA from 1950-2015

A Billion Bits of the Center Column of the Rule 30 Cellular Automaton

The center column of the rule 30 cellular automaton over a billion steps of evolution

UFO Sightings 2015

Dataset of UFO sightings in the United States in 2015

Periodic Groundwater Level Measurements

Dataset of seasonal and long-term groundwater level measurements in groundwater basins in California

Peace Corps Volunteer Demographics (FY 2016)

Dataset of the demographics of Peace Corps volunteers in the 2016 fiscal year

UPS Facilities

This dataset represents UPS facilities

Wind Storms in the U.S., 1955-2015

Wind Storms tracked by NOAA from 1955-2015

United States Supreme Court Decisions 1946-present

Datasets relating to Supreme Court cases from 1946 to present

The Silver Game of Life Lexicon

Famous Game of Life configurations collected by Stephen A. Silver

Urbana Police Arrests Since 1988

Arrests by the Urbana Police Department since 1988

Famous 2D Cellular Automata

Well-known 2D cellular automata are listed, such as Conway’s Game of Life

US Federal Reserve Systems

Detailed location information for all U.S. Federal Reserve Banks and Branches

FedEx Facilities

Information about U.S. FedEx facilities, including exact locations

Supreme Court Justice Database

A database on individual U.S. Supreme Court Justices and various variables including personalities and service on the Court's bench.

Repurposing Therapeutics for COVID-19

Vina Docking scores for drug molecules with the S-protein of SARS-CoV-2 and human human ACE2 receptor

Gridded World Population Density

UN-adjusted gridded world population density for the years 2000, 2005, 2010, and 2015

Hansen Solubility Parameters

Hansen solubility parameters for 211 common solvents

Path of the Total Solar Eclipse of August 21st, 2017

Dataset of the Path of the Total Solar Eclipse of August 21st, 2017

Solutions of the Loculus of Archimedes

The 536 distinct solutions for the Loculus of Archimedes

Cunningham Number Factorizations

Numbers of the form b^n-1 and b^n+1 are factored for small prime bases b={2,3,5,7}

Federal Lands of the United States

Lands owned or administered by the Federal government

CDC Primers for SARS-CoV-2 Research

Primers provided by the US Centers for Disease Control and Prevention (CDC) for identifying SARS-CoV-2 for research purposes, including the names, sequences, working concentration, and related information

Executions in the United States

A dataset about executions (the death penalty) in the United States since the 1976 Supreme Court decision in Gregg v. Georgia (428 U.S. 153)

Geotagged Public Tweets (Europe, April 6-8 2016)

Public Twitter statuses

Urbana Police Stop Sheets

Police Stop Sheet results from the City of Urbana after 2016

California Crop Mapping

Dataset of agricultural land use and irrigated acres in California

The Second Swift Burst Alert Telescope Gamma­Ray Burst Catalog

476 gamma­ray bursts detected by the Swift Burst Alert Telescope (BAT) between 2004 December 19 and 2009 December 21

Paul Revere's Social Network in Colonial Boston

Dataset of associations among political groups in colonial Boston 1762 - 1775

Video Games Until April 2017

Dataset from the Internet Games Database API as of April 2017

New Orleans Slave Sales 1856-1861

Slave sales recorded by the New Orleans register of conveyance, October 1856 to August 1861

Global Immunization Coverage

Immunization coverage by country for 1980—2018

Total Wildland Fires and Acres (1926-2019)

Annual wildland fire statistics for federal and state agencies

COVID-19 Hospital Resource Use Projections

Projected hospital resource use based on COVID-19 deaths

Video Games

Dataset from the Internet Games Database API as of June 2018

The Big Mac Index

The Big Mac index, published by The Economist from 2000 to 2018.

Minecraft Block Types

Wolfram Language EntityStore with IDs and sample images for 150+ types of Minecraft blocks

Urbana Police Incidents

Urbana Police incidents since 1988

Coronavirus COVID-19 Pandemic Government Measures

Measures taken by governments from different countries to fight the coronavirus COVID-19 pandemic caused by the SARS-CoV-2

Public Housing Developments 2015

HUD's PD&R (Office of Policy Development and Research) is responsible for maintaining current information on housing needs, market conditions, and existing programs, as well as conducting research on priority housing ...

Community Development Block Grant Activity by Tract

HUD's PD&R (Office of Policy Development and Research) is responsible for maintaining current information on housing needs, market conditions, and existing programs, as well as conducting research on priority housing ...

Low Income Housing Tax Credit Properties

HUD's PD&R (Office of Policy Development and Research) is responsible for maintaining current information on housing needs, market conditions, and existing programs, as well as conducting research on priority housing ...