Skip Navigation

Curated open data made easily accessible on Azure

US Producer Price Index - Commodities

The Producer Price Index (PPI) is a measure of average change over time in the selling prices received by domestic producers for their output. The prices included in the PPI are from the first commercial transaction for products and services covered.

NOAA NEXRAD Level II

This dataset contains both current and archival level II data from the NEXRAD system.

Boston Safety Data

311 calls reported to the city of Boston.

MODIS

Satellite imagery from the Moderate Resolution Imaging Spectroradiometer (MODIS).

US Population by ZIP Code

US population by gender and race for each US ZIP code sourced from 2010 Decennial Census.

US Local Area Unemployment Statistics

The Local Area Unemployment Statistics (LAUS) program produces monthly and annual employment, unemployment, and labor force data for Census regions and divisions, States, counties, metropolitan areas, and many cities in the United States.

New York City Safety Data

All New York City 311 service requests from 2010 to the present.

Chicago Safety Data

311 service requests from the city of Chicago, including historical sanitation code complaints, pot holes reported, and street light issues

San Francisco Safety Data

Fire department calls for service and 311 cases in San Francisco.

US Consumer Price Index

The Consumer Price Index (CPI) is a measure of the average change over time in the prices paid by urban consumers for a market basket of consumer goods and services.

NAIP

This dataset contains aerial imagery from the National Agricultural Imagery Program (NAIP).

NOAA Global Forecast System (GFS)

15-day US hourly weather forecast data (example: temperature, precipitation, wind) produced by the Global Forecast System (GFS) from the National Oceanic and Atmospheric Administration (NOAA).

Public Holidays

Worldwide public holiday data sourced from PyPI holidays package and Wikipedia, covering 38 countries or regions from 1970 to 2099.

The MNIST database of handwritten digits

The MNIST database of handwritten digits has a training set of 60,000 examples and a test set of 10,000 examples. The digits have been size-normalized and centered in a fixed-size image.

US State Employment Hours and Earnings

The Current Employment Statistics (CES) program produces detailed industry estimates of nonfarm employment, hours, and earnings of workers on payrolls in the United States.

Seattle Safety Data

Seattle Fire Department 911 dispatches.

Harmonized Landsat Sentinel-2

Satellite imagery from the Landsat-8 and Sentinel-2 satellites for North America.

NYC Taxi & Limousine Commission - green taxi trip records

The green taxi trip records include fields capturing pick-up and drop-off dates/times, pick-up and drop-off locations, trip distances, itemized fares, rate types, payment types, and driver-reported passenger counts.

Daymet

Daymet provides gridded estimates of daily weather parameters (minimum temperature, maximum temperature, precipitation, shortwave radiation, vapor pressure, snow water equivalent, and day length) in North America from daily meteorological observations.

NYC Taxi & Limousine Commission - For-Hire Vehicle (FHV) trip records

The For-Hire Vehicle (“FHV”) trip records include fields capturing the dispatching base license number and the pick-up date, time, and taxi zone location ID (shape file below). These records are generated from the FHV Trip Record submissions made by bases.

US National Employment Hours and Earnings

The Current Employment Statistics (CES) program produces detailed industry estimates of nonfarm employment, hours, and earnings of workers on payrolls in the United States.

Sample: Diabetes

The Diabetes dataset has 442 samples with 10 features, making it ideal for getting started with machine learning algorithms. Its one of the popular

US Producer Price Index - Industry

The Producer Price Index (PPI) is a measure of average change over time in the selling prices received by domestic producers for their output. The prices included in the PPI are from the first commercial transaction for products and services covered.

US Population by County

US population by gender and race for each US county sourced from 2000 and 2010 Decennial Census.

NYC Taxi & Limousine Commission - yellow taxi trip records

The yellow taxi trip records include fields capturing pick-up and drop-off dates/times, pick-up and drop-off locations, trip distances, itemized fares, rate types, payment types, and driver-reported passenger counts.

US Labor Force Statistics

Labor Force Statistics labor force, labor force participation rates, and the civilian noninstitutional population by age, gender, race, and ethnic groups. in the United States.

NOAA Integrated Surface Data (ISD)

Worldwide hourly weather history data (example: temperature, precipitation, wind) sourced from the National Oceanic and Atmospheric Administration (NOAA).

Can't find the data? Email us to request a dataset or contribute a dataset