Data Lake Analytics

An on-demand analytics job service to power intelligent action

Easily develop and run massively parallel data transformation and processing programs in U-SQL, R, Python, and .NET over petabytes of data. With no infrastructure to manage, you can process data on demand, scale instantly, and only pay per job.

Start in seconds, scale instantly, pay per job

Process big data jobs in seconds with Azure Data Lake Analytics. There is no infrastructure to worry about because there are no servers, virtual machines, or clusters to wait for, manage, or tune. Instantly scale the processing power, measured in Azure Data Lake Analytics Units (AU), from one to thousands for each job. You only pay for the processing that you use per job.

Develop massively parallel programs with simplicity

U-SQL is a simple, expressive, and extensible language that allows you to write code once and have it automatically parallelized for the scale you need. Process petabytes of data for diverse workload categories such as querying, ETL, analytics, machine learning, machine translation, image processing, and sentiment analysis by leveraging existing libraries written in .NET languages, R, or Python. Watch the U-SQL query execution for Azure Data Lake video to see how we detect the type of objects in one million images using a U-SQL built-in cognitive library.

Debug and optimize your big data programs with ease

Debug failures in cloud distributed programs as easily as debugging a program in your personal environment. Our execution environment actively analyzes your programs as they run and gives you recommendations to improve performance and reduce cost. For example, if you request 1000 AUs for your program and only 50 AUs are needed, the system recommends that you only use 50 AUs—reducing the cost by 95%.

Virtualize your analytics

Act on all of your data with optimized data virtualization of your relational sources such as Azure SQL Database and Azure SQL Data Warehouse. Your queries are automatically optimized by moving processing close to the source data without data movement, which maximizes performance and minimizes latency.

Enterprise-grade security, auditing, and support

Extend your on-premises security and governance controls to the cloud, and meet your security and regulatory compliance needs. Single sign-on (SSO), multi-factor authentication, and seamless management of millions of identities are built-in through Azure Active Directory. Role-based access control and the ability to audit all processing and management operations are on by default. We guarantee a 99.9% enterprise-grade SLA and 24/7 support for your big data solution.

Related products and services

Data Lake Store

Hyperscale repository for big data analytics workloads

HDInsight

Provision cloud Hadoop, Spark, R Server, HBase, and Storm clusters

Try Data Lake Analytics for free