Data Lake Analytics

An on-demand analytics job service to power intelligent action

Easily develop and run massively parallel data transformation and processing programmes in U-SQL, R, Python and .NET over petabytes of data. With no infrastructure to manage, you can process data on demand, scale instantly and only pay per job.

Start in seconds, scale instantly, pay per job

Process big data jobs in seconds with Azure Data Lake Analytics. There is no infrastructure to worry about because there are no servers, virtual machines or clusters to wait for, manage or tune. Instantly scale the processing power, measured in Azure Data Lake Analytics Units (AU), from one to thousands for each job. You only pay for the processing which you use per job.

Develop massively parallel programs with simplicity

U-SQL is a simple, expressive and extensible language which allows you to write code once and have it automatically parallelised for the scale you need. Process petabytes of data for diverse workload categories such as querying, ETL, analytics, machine learning, machine translation, image processing and sentiment analysis by leveraging existing libraries written in .NET languages, R or Python. Watch the U-SQL query execution for Azure Data Lake video to see how we detect the type of objects in one million images using a U-SQL built-in cognitive library.

Debug and optimise your big data programs with ease

Debug failures in cloud distributed programmes as easily as debugging a programme in your personal environment. Our execution environment actively analyses your programmes as they run and gives you recommendations to improve performance and reduce cost. For example, if you request 1000 AUs for your programme and only 50 AUs are needed, the system recommends that you only use 50 AUs—reducing the cost by 95%.

Virtualise your analytics

Act on all of your data with optimised data virtualisation of your relational sources such as Azure SQL Database and Azure SQL Data Warehouse. Your queries are automatically optimised by moving processing close to the source data without data movement, which maximises performance and minimises latency.

Enterprise-grade security, auditing and support

Extend your on-premises security and governance controls to the cloud and meet your security and regulatory compliance needs. Single sign-on (SSO), multi-factor authentication and seamless management of millions of identities are built-in through Azure Active Directory. Role-based access control and the ability to audit all processing and management operations are on by default. We guarantee a 99.9% enterprise-grade SLA and 24/7 support for your big data solution.

Related products and services

Data Lake Store

Hyperscale repository for big data analytics workloads

HDInsight

Provision cloud Hadoop, Spark, R Server, HBase, and Storm clusters

Try Data Lake Analytics for free