Data Lake Analytics

An on-demand analytics job service to power intelligent action

The first cloud analytics service where you can easily develop and run massively parallel data transformation and processing programs in U-SQL, R, Python and .Net over petabytes of data. With no infrastructure to manage, you can process data on demand, scale instantly and only pay per job.

Start in seconds, scale instantly and pay per job

Our on-demand service will have you processing big data jobs in seconds. There is no infrastructure to worry about because there are no servers, virtual machines or clusters to wait for, manage or tune. You can instantly scale the analytic units (processing power) from one to thousands for each job. You only pay for the processing used per job.

Develop massively parallel programs with simplicity

U-SQL is a simple, expressive, and extensible language that allows you to write code once and have it automatically parallelized for the scale you need. Process petabytes of data for diverse workload categories such as querying, ETL, analytics, machine learning, machine translation, image processing, and sentiment analysis by leveraging existing libraries written in .NET languages, R, or Python. Watch the U-SQL query execution for Azure Data Lake video to see how we detect the type of objects in one million images using a U-SQL built-in cognitive library.

Debug and optimise your big data programs with ease

Debugging failures in cloud-distributed programs is now as easy as debugging a program in your personal environment. Our execution environment actively analyses your programs as they run and offers recommendations to improve performance and reduce cost. For example, if you requested 1,000 AUs for your program and only 50 AUs were needed, the system would recommend that you only use 50 AUs, resulting in cost savings that are 20 times higher.

Virtualise your analytics

The power to act on all of your data, with optimised data virtualisation of your relational sources such as Azure SQL Database and Azure SQL Data Warehouse. Queries are automatically optimised by moving processing close to the source data without data movement, thereby maximising performance and minimising latency.

Enterprise-grade security, auditing and support

Extend your on-premises security and governance controls to the cloud for meeting your security and regulatory compliance needs. Capabilities such as single sign-on (SSO), multi-factor authentication and seamless management of millions of identities are built in with Azure Active Directory. Role-based access control and the ability to audit all processing and management operations are on by default. We guarantee a 99.9% enterprise-grade SLA and 24/7 support for your big data solution.

Related products and services

Data Lake Store

Hyper-scale repository for big data analytics workloads

HDInsight

Provision cloud Hadoop, Spark, R Server, HBase and Storm clusters

Get started with Data Lake Analytics