Azure Data Lake Storage Gen2 pricing

Massively scalable, secure data lake storage built on Blob Storage

Azure Data Lake Storage Gen2 is the world’s most productive Data Lake. It combines the power of a Hadoop compatible file system with integrated hierarchical namespace with the massive scale and economy of Azure Blob Storage to help speed your transition from proof of concept to production.

  1. Is fully integrated with the Azure Blob Storage platform; Every Blob Storage capability (Azure Active Directory Integration, At Rest Encryption, High-availability and disaster recovery, Automated Lifecycle Policy Management, Storage Account Firewalls, etc.) can be used by Azure Data Lake Storage
  2. Is optimized for leading cloud analytic engines
  3. Is tightly integrated with all elements of the Azure Big Data Analytics stack to deliver fast insights
  4. Allows analytics data to coexist with object data in the same store with no programming changes or data copying
  5. Performs faster than other data stores thus making your analytics workloads run faster and lowering your TCO

General Purpose v2 provides access to the latest Azure storage features, including Cool and Archive storage, with pricing optimized for the lowest GB storage prices. These accounts provide access to Data Lake Storage, Block Blobs, Page Blobs, Files, and Queues.

Azure Data Lake Storage Gen2 provides the choice of organizing data in two different ways. With hierarchical namespaces option, customers can organize their Data Lake into structured directories, folders and files. With flat namespaces, customers can operate their Data Lake as an unstructured blob store. Regardless of the two options, customers will pay for the same storage price as per the table below. However, with the hierarchical namespace option, customers will be charged for additional meta-data associated with the folder and directory structure as part of the bill.

Azure Data Lake Storage Gen2 in now Generally Available and the prices below reflects GA prices in all regions except UAE. Until April 1, 2019, the Azure Data Lake Storage Gen2 will be charged at 50% discount.

Data storage prices

Hot Cool Archive
First 50 terabyte (TB) / month $- per GB $- per GB $- per GB
Over 450 TB / Month $- per GB $- per GB $- per GB
Over 500 TB / Month $- per GB $- per GB $- per GB
First 50 terabyte (TB) / month $- per GB $- per GB $- per GB
Over 450 TB / Month $- per GB $- per GB $- per GB
Over 500 TB / Month $- per GB $- per GB $- per GB
First 50 terabyte (TB) / month $- per GB $- per GB $- per GB
Over 450 TB / Month $- per GB $- per GB $- per GB
Over 500 TB / Month $- per GB $- per GB $- per GB
First 50 terabyte (TB) / month $- per GB $- per GB N/A
Over 450 TB / Month $- per GB $- per GB N/A
Over 500 TB / Month $- per GB $- per GB N/A
Hot Cool Archive
First 50 terabyte (TB) / month $- per GB $- per GB $- per GB
Over 450 TB / Month $- per GB $- per GB $- per GB
Over 500 TB / Month $- per GB $- per GB $- per GB
First 50 terabyte (TB) / month $- per GB $- per GB $- per GB
Over 450 TB / Month $- per GB $- per GB $- per GB
Over 500 TB / Month $- per GB $- per GB $- per GB
First 50 terabyte (TB) / month $- per GB $- per GB $- per GB
Over 450 TB / Month $- per GB $- per GB $- per GB
Over 500 TB / Month $- per GB $- per GB $- per GB
First 50 terabyte (TB) / month $- per GB $- per GB N/A
Over 450 TB / Month $- per GB $- per GB N/A
Over 500 TB / Month $- per GB $- per GB N/A

Transaction prices

Hot Cool Archive
Write Operations* (every 4MB, per 10,000) $- $- $-
Read Operations** (every 4MB, per 10,000) $- $- $-
Write Operations* (every 4MB, per 10,000) $- $- $-
Read Operations** (every 4MB, per 10,000) $- $- $-
Write Operations* (every 4MB, per 10,000) $- $- $-
Read Operations** (every 4MB, per 10,000) $- $- $-
Write Operations* (every 4MB, per 10,000) $- $- N/A
Read Operations** (every 4MB, per 10,000) $- $- N/A
*The following API calls are considered write operations: AppendFile, CreateFilesystem, CreatePath, CreatePathFile, FlushFile, SetFileProperties, SetFilesystemProperties, RenameFile, RenamePathFile, CopyFile **The following API calls are considered read operations: ReadFile, ListFilesystemFile
Hot Cool Archive
Write Operations* (every 4MB, per 10,000) $- $- $-
Read Operations** (every 4MB, per 10,000) $- $- $-
Write Operations* (every 4MB, per 10,000) $- $- $-
Read Operations** (every 4MB, per 10,000) $- $- $-
Write Operations* (every 4MB, per 10,000) $- $- $-
Read Operations** (every 4MB, per 10,000) $- $- $-
Write Operations* (every 4MB, per 10,000) $- $- N/A
Read Operations** (every 4MB, per 10,000) $- $- N/A
*The following API calls are considered write operations: AppendFile, CreateFilesystem, CreatePath, CreatePathFile, FlushFile, SetFileProperties, SetFilesystemProperties, RenameFile, RenamePathFile, CopyFile **The following API calls are considered read operations: ReadFile, ListFilesystemFile

Archive early deletion

In addition to the per-GB, per-month charge, any blob that is moved to Archive is subject to an Archive early deletion period of 180 days. This charge is prorated. For example, if a blob is moved to Archive and then deleted or moved to the Hot tier after 45 days, the customer is charged an early deletion fee equivalent to 135 (180 minus 45) days of storing that blob in Archive.

Other Operations and Meta data Storage meters

Hot Cool Archive
List and Create Container Operations (per 10,000) $- $- $-
Iterative Write Operations (100’s)* $- $- $-
All other Operations (per 10,000), except Delete, which is free $- $- $-
Data Retrieval (per GB) Free $- $-
Data Write (per GB) Free Free Free
Meta-data storage (GB / Month) N/A N/A N/A
List and Create Container Operations (per 10,000) $- $- $-
Iterative Write Operations (100’s)* $- $- $-
All other Operations (per 10,000), except Delete, which is free $- $- $-
Data Retrieval (per GB) Free $- $-
Data Write (per GB) Free Free Free
Meta-data storage (GB / Month) N/A N/A N/A
List and Create Container Operations (per 10,000) $- $- $-
Iterative Write Operations (100’s)* $- $- $-
All other Operations (per 10,000), except Delete, which is free $- $- $-
Data Retrieval (per GB) Free $- $-
Data Write (per GB) Free Free Free
Meta-data storage (GB / Month) N/A N/A N/A
List and Create Container Operations (per 10,000) $- $- N/A
Iterative Write Operations (100’s)* $- $- N/A
All other Operations (per 10,000), except Delete, which is free $- $- N/A
Data Retrieval (per GB) Free $- N/A
Data Write (per GB) Free Free N/A
Meta-data storage (GB / Month) N/A N/A N/A
*The following API calls are considered iterative write operations: RenameDirectory,RenamePath,RenamePathDir
Hot Cool Archive
List and Create Container Operations (per 10,000) $- $- $-
Iterative Write Operations (100’s)* $- $- $-
All other Operations (per 10,000), except Delete, which is free $- $- $-
Data Retrieval (per GB) Free $- $-
Data Write (per GB) Free Free Free
Meta-data storage (GB / Month) $- N/A N/A
List and Create Container Operations (per 10,000) $- $- $-
Iterative Write Operations (100’s)* $- $- $-
All other Operations (per 10,000), except Delete, which is free $- $- $-
Data Retrieval (per GB) Free $- $-
Data Write (per GB) Free Free Free
Meta-data storage (GB / Month) $- N/A N/A
List and Create Container Operations (per 10,000) $- $- $-
Iterative Write Operations (100’s)* $- $- $-
All other Operations (per 10,000), except Delete, which is free $- $- $-
Data Retrieval (per GB) Free $- $-
Data Write (per GB) Free Free Free
Meta-data storage (GB / Month) $- N/A N/A
List and Create Container Operations (per 10,000) $- $- N/A
Iterative Write Operations (100’s)* $- $- N/A
All other Operations (per 10,000), except Delete, which is free $- $- N/A
Data Retrieval (per GB) Free $- N/A
Data Write (per GB) Free Free N/A
Meta-data storage (GB / Month) $- N/A N/A
*The following API calls are considered iterative write operations: RenameDirectory,RenamePath,RenamePathDir

Data transfer prices for ADLS Gen2

When you write data into GRS accounts, that data will be replicated to another Azure region. The Geo-Replication Data Transfer charge is for the bandwidth of replicating that data to another Azure region. This charge also applies when you change the account replication setting from LRS to GRS or RA-GRS. View the Data transfer prices on Blobs pricing page.

FAQ

  • Azure Data Lake Storage is optimized for running analytic workloads on unstructured data. Azure Data Lake Storage Gen2 is optimized for fast I/O of high volume data, thereby making analytic workloads run faster and lowering the TCO for analytic jobs. Further, Azure Data Lake Storage Gen2 provides the added flexibility of organizing data either in a flat or hierarchical namespace.

  • With hierarchical namespaces, you can organize data into structured folders and directories. With a flat namespace, your files will be organized in a flat structure just like Blob Storage. A hierarchical namespace allows operations like folder renames and deletes to be performed in a single atomic operation, which with a flat namespace requires a number of operations proportionate to the number of objects in the structure. Hierarchical namespaces store additional meta-data for your directory and folder structure. However, as your data volume grows, hierarchical namespaces keeps your data organized and more importantly yields better storage performance on your analytic jobs – thus lowering your overall TCO to run analytic jobs.

    • Billing using flat namespaces

    Let’s say you store 120TB of data for the whole month in Azure Data Lake Storage Gen2 using flat namespaces in the “Hot Tier”. During this month, you perform 100 million operations, and let’s assume each operation is 6MB in size. Further, let’s say that 20% of these operations are write operations and the other 80% are read operations. Finally, let’s assume you also rename 10K directories during the month.

    For flat namespaces, you will not incur additional charges for meta-data related to your files and folders and therefore will be charged for 120TB of data. Also, since every operation is 6MB, you will be charged two transactions per operation (4MB + 2MB), since every 4MB of data read or written is charged as a transaction. Finally, directory renames are charged using a separate meter.

    This is how your total cost will be calculated:

    Resource Used Usage Volume Price Monthly Cost
    Storage 120TB

    $- for first 50TB

    $- for next 450TB

    $- * 50TB = $-

    $- * 70TB = $-

    Total $-
    Transactions Writes: Every 4MB = 2

    $- per 10K

    2 * $- per 10K * 20M = $-

    Reads: Every 4MB = 2

    $- per 10K

    2 * $- per 10K * 80M = $-

    Total $-
    Directory Renames 10K $- per 100 $- per 100 * 10K = $-
    Total Monthly Cost:

    Storage

    Transactions

    Directory Renames

    Total Monthly Cost

    $-

    $-

    $-

    $-


    • Billing using hierarchical namespaces

    Let’s say you store 120TB of data for the whole month in Azure Data Lake Storage Gen2 using hierarchical namespaces in the “Hot Tier”. During this month, you perform 100 million operations, and let’s assume each operation is 6MB in size. Further, let’s say that 20% of these operations are write operations and the other 80% are read operations. Finally, let’s assume you also rename 10K directories during the month.

    For hierarchical namespaces, you will incur additional charges for meta-data related to your files and folders and therefore will be charged for a little more 120TB of data. Also, since every operation is 6MB, you will be charged two transactions per operation (4MB + 2MB), since every 4MB of data read or written is charged as a transaction. Finally, directory renames are charged using a separate meter.

    This is how your total cost will be calculated:

    Resource Used Usage Volume Price Monthly Cost
    Storage 132TB

    $- for first 50TB

    $- for next 450TB

    $- * 50TB = $-

    $- * 82TB = $-

    Total $-
    Transactions Writes: Every 4MB = 2

    $- per 10K

    2 * $- per 10K * 20M = $-

    Reads: Every 4MB = 2

    $- per 10K

    2 * $- per 10K * 80M = $-

    Total $-
    Directory Renames 10K $- per 100 $- per 100 * 10K = $-
    Total Monthly Cost:

    Storage

    Transactions

    Directory Renames

    Total Monthly Cost

    $-

    $-

    $-

    $-

  • Yes, larger files are more cost effective and yield better analytic performance. For files > 4MB in size, Azure Data Lake Storage Gen2 offers lower price for every 4MB block of data read beyond the first 4MB. To read a single file that is 16 MB is cheaper than reading 4 files that are 4MB each. In both cases, the total number of transactions are 4. However, the last 12 MB, read as three transactions, from the 16MB file are cheaper, there by making the total cost of reading a 16MB file cheaper than reading 4 files that are 4MB each.

    More importantly, Azure Data Lake Storage Gen2 is highly optimized to perform faster on larger files. This means that your analytics jobs will run faster, when operating on larger files, thus further your TCO for running analytics jobs.

Support & SLA

  • Free billing and subscription management support.
  • Flexible support plans starting at $29/month. Shop for a plan.
  • Guaranteed 99.9 percent or greater availability (excludes preview services). Read the SLA.

Resources

Estimate your monthly costs for Azure services

Review Azure pricing frequently asked questions

Learn more about Storage Accounts

Review technical tutorials, videos, and more resources

Added to estimate. Press 'v' to view on calculator

Learn and build with $200 in credit, and keep going for free