Azure Data Lake Storage

Massively scalable, secure data lake functionality built on Azure Blob Storage

Get powerful data lake functionality at cloud scale

Azure Data Lake Storage Gen2 is a highly scalable and cost-effective data lake solution for big data analytics. It combines the power of a high-performance file system with massive scale and economy to help you speed your time to insight. Data Lake Storage Gen2 extends Azure Blob Storage capabilities and is optimised for analytics workloads. Data Lake Storage Gen2 is the most comprehensive data lake available.

Read the blog

See more videos

Fast

Test models faster with a Hadoop-compatible file system that supports atomic file and folder operations and is optimised to execute jobs at lightning speed.

Scalable

Extend the global scale, durability and performance of Azure Blob Storage, and get support for massive storage accounts.

Secure

Meet the most stringent enterprise data security requirements with tools and resources such as POSIX-compliant, fine-grained ACL support, object store security with at-rest encryption, Azure Active Directory integration and storage account firewalls.

Cost-effective

Get data lake functionality at cloud object store pricing levels. Data Lake Storage Gen2 provides the same lifecycle policy management and object-level tiering that’s built into Blob Storage.

Service capabilities

Massive scalability

Near limitless storage for analytics data

Cloud object store pricing

Same low-cost data storage model as Azure Blob Storage

Fewer file and folder transactions

Atomic transactions for fewer compute cycles and faster job execution

Granular file and folder security

POSIX-compliant, fine-grained access control lists (ACLs)

Simplified ingestion in a single store

Consolidated data storage using the Data Lake Storage Gen2 or Blob Storage REST API

Full Azure Blob Storage feature set

Data lifecycle policy management; hot, cool and archive tiers; and high availability/disaster recovery support

Role-based access and storage account firewalls

Multi-layer security to govern data access so only users from authorised IPs can perform analytics

Common data model (CDM) support

Ability to exchange data with powerful applications like Microsoft Dynamics 365 (for CRM) and Power BI

Trusted partners

  • Informatica Cloud
  • Attunity
  • WANDisco
  • Striim
  • Qubole
  • Cloudera

Related products and services

Azure Databricks

Fast, easy and collaborative Apache Spark-based analytics platform

Data Factory

Hybrid data integration at enterprise scale, made easy

Azure Synapse Analytics

Limitless analytics service with unmatched time to insight

Get started with Azure Data Lake Storage