Azure Data Lake Storage

Massively scalable, secure data lake functionality built on Azure Blob Storage

Get powerful data lake functionality at cloud scale

Azure Data Lake Storage Gen2 is a highly scalable and cost-effective data lake solution for big data analytics. It combines the power of a high-performance file system with massive scale and economy to help you speed your time to insight. Data Lake Storage Gen2 extends Azure Blob Storage capabilities and is optimized for analytics workloads. Data Lake Storage Gen2 is the most comprehensive data lake available.

Read the blog

See more videos


Test models faster with a Hadoop-compatible file system that supports atomic file and folder operations and is optimized to execute jobs at lightning speed.


Extend the global scale, durability, and performance of Azure Blob Storage, and get support for massive storage accounts.


Meet the most stringent enterprise data security requirements with tools and resources like POSIX-compliant, fine-grained ACL support, object store security with at-rest encryption, Azure Active Directory integration, and storage account firewalls.


Get data lake functionality at cloud object store pricing levels. Data Lake Storage Gen2 provides the same lifecycle policy management and object-level tiering that’s built into Blob Storage.

Service capabilities

Massive scalability

Near limitless storage for analytics data

Cloud object store pricing

Same low-cost data storage model as Azure Blob Storage

Fewer file and folder transactions

Atomic transactions for fewer compute cycles and faster job execution

Granular file and folder security

POSIX-compliant, fine-grained access control lists (ACLs)

Simplified ingestion in a single store

Consolidated data storage using the Data Lake Storage Gen2 or Blob Storage REST API

Full Azure Blob Storage feature set

Data lifecycle policy management; hot, cool, and archive tiers; and high availability/disaster recovery support

Role-based access and storage account firewalls

Multi-layer security to govern data access so only users from authorized IPs can perform analytics

Common data model (CDM) support

Ability to exchange data with powerful applications like Microsoft Dynamics 365 (for CRM) and Power BI

Trusted partners

  • Informatica
  • Attunity
  • WANDisco
  • Striim
  • Qubole
  • Cloudera

Related products and services

Azure Databricks

Fast, easy, and collaborative Apache Spark-based analytics platform

Data Factory

Hybrid data integration at enterprise scale, made easy

SQL Data Warehouse

Elastic data warehouse as a service with enterprise-class features

Get started with Azure Data Lake Storage