Skip Navigation

US Producer Price Index - Commodities

labor statistics ppi commodity

The Producer Price Index (PPI) is a measure of average change over time in the selling prices received by domestic producers for their output. The prices included in the PPI are from the first commercial transaction for products and services covered.

About 10,000 PPIs for individual products and groups of products are released each month. PPIs are available for the output of nearly all industries in the goods-producing sectors of the U.S. economy–mining, manufacturing, agriculture, fishing, and forestry–as well as natural gas, electricity, construction, and goods competitive with those made in the producing sectors, such as waste and scrap materials. The PPI program covers approximately 72 percent of the service sector’s output, as measured by revenue reported in the 2007 Economic Census. Data includes industries in the following sectors: wholesale and retail trade; transportation and warehousing; information; finance and insurance; real estate brokering, rental, and leasing; professional, scientific, and technical services; administrative, support, and waste management services; health care and social assistance; and accommodation.

README containing file for detailed information about this dataset is available at original dataset location. Additional information is available in the FAQs.

This dataset is produced from the Producer Price Indexes data published by US Bureau of Labor Statistics (BLS). Review Linking and Copyright Information and Important Web Site Notices for the terms and conditions related to the use this dataset.

Storage Location

This dataset is stored in the East US Azure region. Allocating compute resources in East US is recommended for affinity.

Notices

MICROSOFT PROVIDES AZURE OPEN DATASETS ON AN “AS IS” BASIS. MICROSOFT MAKES NO WARRANTIES, EXPRESS OR IMPLIED, GUARANTEES OR CONDITIONS WITH RESPECT TO YOUR USE OF THE DATASETS. TO THE EXTENT PERMITTED UNDER YOUR LOCAL LAW, MICROSOFT DISCLAIMS ALL LIABILITY FOR ANY DAMAGES OR LOSSES, INCLUDING DIRECT, CONSEQUENTIAL, SPECIAL, INDIRECT, INCIDENTAL OR PUNITIVE, RESULTING FROM YOUR USE OF THE DATASETS.

This dataset is provided under the original terms that Microsoft received source data. The dataset may include data sourced from Microsoft.

Access

Available inWhen to use
Azure Notebooks

Quickly explore the dataset with Jupyter notebooks hosted on Azure or your local machine.

Azure Databricks

Use this when you need the scale of an Azure managed Spark cluster to process the dataset.

Azure Synapse

Use this when you need the scale of an Azure managed Spark cluster to process the dataset.

Preview

item_code group_code series_id year period value footnote_codes seasonal series_title group_name item_name
120922 05 WPU05120922 2008 M06 100 nan U PPI Commodity data for Fuels and related products and power-Prepared bituminous coal underground mine, mechanically crushed/screened/sized only, not seasonally adjusted Fuels and related products and power Prepared bituminous coal underground mine, mechanically crushed/screened/sized only
120922 05 WPU05120922 2008 M07 104.6 nan U PPI Commodity data for Fuels and related products and power-Prepared bituminous coal underground mine, mechanically crushed/screened/sized only, not seasonally adjusted Fuels and related products and power Prepared bituminous coal underground mine, mechanically crushed/screened/sized only
120922 05 WPU05120922 2008 M08 104.4 nan U PPI Commodity data for Fuels and related products and power-Prepared bituminous coal underground mine, mechanically crushed/screened/sized only, not seasonally adjusted Fuels and related products and power Prepared bituminous coal underground mine, mechanically crushed/screened/sized only
120922 05 WPU05120922 2008 M09 98.3 nan U PPI Commodity data for Fuels and related products and power-Prepared bituminous coal underground mine, mechanically crushed/screened/sized only, not seasonally adjusted Fuels and related products and power Prepared bituminous coal underground mine, mechanically crushed/screened/sized only
120922 05 WPU05120922 2008 M10 101.5 nan U PPI Commodity data for Fuels and related products and power-Prepared bituminous coal underground mine, mechanically crushed/screened/sized only, not seasonally adjusted Fuels and related products and power Prepared bituminous coal underground mine, mechanically crushed/screened/sized only
120922 05 WPU05120922 2008 M11 95.2 nan U PPI Commodity data for Fuels and related products and power-Prepared bituminous coal underground mine, mechanically crushed/screened/sized only, not seasonally adjusted Fuels and related products and power Prepared bituminous coal underground mine, mechanically crushed/screened/sized only
120922 05 WPU05120922 2008 M12 96.7 nan U PPI Commodity data for Fuels and related products and power-Prepared bituminous coal underground mine, mechanically crushed/screened/sized only, not seasonally adjusted Fuels and related products and power Prepared bituminous coal underground mine, mechanically crushed/screened/sized only
120922 05 WPU05120922 2009 M01 104.2 nan U PPI Commodity data for Fuels and related products and power-Prepared bituminous coal underground mine, mechanically crushed/screened/sized only, not seasonally adjusted Fuels and related products and power Prepared bituminous coal underground mine, mechanically crushed/screened/sized only
120922 05 WPU05120922 2009 M02 113.2 nan U PPI Commodity data for Fuels and related products and power-Prepared bituminous coal underground mine, mechanically crushed/screened/sized only, not seasonally adjusted Fuels and related products and power Prepared bituminous coal underground mine, mechanically crushed/screened/sized only
120922 05 WPU05120922 2009 M03 121 nan U PPI Commodity data for Fuels and related products and power-Prepared bituminous coal underground mine, mechanically crushed/screened/sized only, not seasonally adjusted Fuels and related products and power Prepared bituminous coal underground mine, mechanically crushed/screened/sized only
Name Data type Unique Values (sample) Description
footnote_codes string 3 nan
P

Identifies footnotes for the data series. Most values are null. See https://download.bls.gov/pub/time.series/wp/wp.footnote.

group_code string 56 02
01

Code identifying major commodity group covered by the index. See https://download.bls.gov/pub/time.series/wp/wp.group for group codes and names.

group_name string 56 Processed foods and feeds
Farm products

Name of the major commodity group covered by the index. See https://download.bls.gov/pub/time.series/wp/wp.group for group codes and names.

item_code string 2,949 1
11

Identifies item for which data observations pertain. See https://download.bls.gov/pub/time.series/wp/wp.item for item codes and names.

item_name string 3,410 Warehousing, storage, and related services
Security guard services

Full names of items. See https://download.bls.gov/pub/time.series/wp/wp.item for item codes and names.

period string 13 M06
M07

Identifies period for which data is observed. See https://download.bls.gov/pub/time.series/wp/wp.period for list of period values.

seasonal string 2 U
S

Code identifying whether the data are seasonally adjusted. S=Seasonally Adjusted; U=Unadjusted

series_id string 5,458 WPU081
WPU051

Code identifying the specific series. A time series refers to a set of data observed over an extended period of time over consistent time intervals. See https://download.bls.gov/pub/time.series/wp/wp.series for details of series such as code, name, start and end year, etc.

series_title string 4,379 PPI Commodity data for Mining services, not seasonally adjusted
PPI Commodity data for Metal treatment services, not seasonally adjusted

Title of the specific series. A time series refers to a set of data observed over an extended period of time over consistent time intervals. See https://download.bls.gov/pub/time.series/wp/wp.series for details of series such as id, name, start and end year, etc.

value float 6,788 100.0
99.0999984741211

Price index for item.

year int 26 2018
2017

Identifies year of observation.

Select your preferred service:

Azure Notebooks

Azure Databricks

Azure Synapse

Azure Notebooks

Package: Language: Python Python
In [1]:
# This is a package in preview.
from azureml.opendatasets import UsLaborPPICommodity

labor = UsLaborPPICommodity()
labor_df = labor.to_pandas_dataframe()
ActivityStarted, to_pandas_dataframe
ActivityStarted, to_pandas_dataframe_in_worker
Looking for parquet files...
Reading them into Pandas dataframe...
Reading ppi_commodity/part-00000-tid-160579496407747812-077bf440-b39a-4520-9373-0a3f021dd59e-5654-1-c000.snappy.parquet under container laborstatisticscontainer
Done.
ActivityCompleted: Activity=to_pandas_dataframe_in_worker, HowEnded=Success, Duration=20409.23 [ms]
ActivityCompleted: Activity=to_pandas_dataframe, HowEnded=Success, Duration=20434.79 [ms]
In [2]:
labor_df.info()
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 6825676 entries, 0 to 6825675
Data columns (total 11 columns):
item_code         object
group_code        object
series_id         object
year              int32
period            object
value             float32
footnote_codes    object
seasonal          object
series_title      object
group_name        object
item_name         object
dtypes: float32(1), int32(1), object(9)
memory usage: 520.8+ MB
In [1]:
# Pip install packages
import os, sys

!{sys.executable} -m pip install azure-storage-blob
!{sys.executable} -m pip install pyarrow
!{sys.executable} -m pip install pandas
In [2]:
# Azure storage access info
azure_storage_account_name = "azureopendatastorage"
azure_storage_sas_token = r""
container_name = "laborstatisticscontainer"
folder_name = "ppi_commodity/"
In [3]:
from azure.storage.blob import BlockBlobServicefrom azure.storage.blob import BlobServiceClient, BlobClient, ContainerClient

if azure_storage_account_name is None or azure_storage_sas_token is None:
    raise Exception(
        "Provide your specific name and key for your Azure Storage account--see the Prerequisites section earlier.")

print('Looking for the first parquet under the folder ' +
      folder_name + ' in container "' + container_name + '"...')
container_url = f"https://{azure_storage_account_name}.blob.core.windows.net/"
blob_service_client = BlobServiceClient(
    container_url, azure_storage_sas_token if azure_storage_sas_token else None)

container_client = blob_service_client.get_container_client(container_name)
blobs = container_client.list_blobs(folder_name)
sorted_blobs = sorted(list(blobs), key=lambda e: e.name, reverse=True)
targetBlobName = ''
for blob in sorted_blobs:
    if blob.name.startswith(folder_name) and blob.name.endswith('.parquet'):
        targetBlobName = blob.name
        break

print('Target blob to download: ' + targetBlobName)
_, filename = os.path.split(targetBlobName)
blob_client = container_client.get_blob_client(targetBlobName)
with open(filename, 'wb') as local_file:
    blob_client.download_blob().download_to_stream(local_file)
In [4]:
# Read the parquet file into Pandas data frame
import pandas as pd

print('Reading the parquet file into Pandas data frame')
df = pd.read_parquet(filename)
In [5]:
# you can add your filter at below
print('Loaded as a Pandas data frame: ')
df
In [6]:
 

Azure Databricks

Package: Language: Python Python
In [1]:
# This is a package in preview.
from azureml.opendatasets import UsLaborPPICommodity

labor = UsLaborPPICommodity()
labor_df = labor.to_spark_dataframe()
ActivityStarted, to_spark_dataframe ActivityStarted, to_spark_dataframe_in_worker ActivityCompleted: Activity=to_spark_dataframe_in_worker, HowEnded=Success, Duration=2871.21 [ms] ActivityCompleted: Activity=to_spark_dataframe, HowEnded=Success, Duration=2875.06 [ms]
In [2]:
display(labor_df.limit(5))
item_codegroup_codeseries_idyearperiodvaluefootnote_codesseasonalseries_titlegroup_nameitem_name
12092205WPU05120922 2008M06100.0nanUPPI Commodity data for Fuels and related products and power-Prepared bituminous coal underground mine, mechanically crushed/screened/sized only, not seasonally adjustedFuels and related products and powerPrepared bituminous coal underground mine, mechanically crushed/screened/sized only
12092205WPU05120922 2008M07104.6nanUPPI Commodity data for Fuels and related products and power-Prepared bituminous coal underground mine, mechanically crushed/screened/sized only, not seasonally adjustedFuels and related products and powerPrepared bituminous coal underground mine, mechanically crushed/screened/sized only
12092205WPU05120922 2008M08104.4nanUPPI Commodity data for Fuels and related products and power-Prepared bituminous coal underground mine, mechanically crushed/screened/sized only, not seasonally adjustedFuels and related products and powerPrepared bituminous coal underground mine, mechanically crushed/screened/sized only
12092205WPU05120922 2008M0998.3nanUPPI Commodity data for Fuels and related products and power-Prepared bituminous coal underground mine, mechanically crushed/screened/sized only, not seasonally adjustedFuels and related products and powerPrepared bituminous coal underground mine, mechanically crushed/screened/sized only
12092205WPU05120922 2008M10101.5nanUPPI Commodity data for Fuels and related products and power-Prepared bituminous coal underground mine, mechanically crushed/screened/sized only, not seasonally adjustedFuels and related products and powerPrepared bituminous coal underground mine, mechanically crushed/screened/sized only
In [1]:
# Azure storage access info
blob_account_name = "azureopendatastorage"
blob_container_name = "laborstatisticscontainer"
blob_relative_path = "ppi_commodity/"
blob_sas_token = r""
In [2]:
# Allow SPARK to read from Blob remotely
wasbs_path = 'wasbs://%s@%s.blob.core.windows.net/%s' % (blob_container_name, blob_account_name, blob_relative_path)
spark.conf.set(
  'fs.azure.sas.%s.%s.blob.core.windows.net' % (blob_container_name, blob_account_name),
  blob_sas_token)
print('Remote blob path: ' + wasbs_path)
In [3]:
# SPARK read parquet, note that it won't load any data yet by now
df = spark.read.parquet(wasbs_path)
print('Register the DataFrame as a SQL temporary view: source')
df.createOrReplaceTempView('source')
In [4]:
# Display top 10 rows
print('Displaying top 10 rows: ')
display(spark.sql('SELECT * FROM source LIMIT 10'))

Azure Synapse

Package: Language: Python
In [1]:
# Azure storage access info
blob_account_name = "azureopendatastorage"
blob_container_name = "laborstatisticscontainer"
blob_relative_path = "ppi_commodity/"
blob_sas_token = r""
In [2]:
# Allow SPARK to read from Blob remotely
wasbs_path = 'wasbs://%s@%s.blob.core.windows.net/%s' % (blob_container_name, blob_account_name, blob_relative_path)
spark.conf.set(
  'fs.azure.sas.%s.%s.blob.core.windows.net' % (blob_container_name, blob_account_name),
  blob_sas_token)
print('Remote blob path: ' + wasbs_path)
In [3]:
# SPARK read parquet, note that it won't load any data yet by now
df = spark.read.parquet(wasbs_path)
print('Register the DataFrame as a SQL temporary view: source')
df.createOrReplaceTempView('source')
In [4]:
# Display top 10 rows
print('Displaying top 10 rows: ')
display(spark.sql('SELECT * FROM source LIMIT 10'))