{"id":4326,"date":"2016-11-16T00:00:00","date_gmt":"2016-11-16T00:00:00","guid":{"rendered":"https:\/\/azure.microsoft.com\/blog\/the-intelligent-data-lake"},"modified":"2025-06-17T09:50:10","modified_gmt":"2025-06-17T16:50:10","slug":"the-intelligent-data-lake","status":"publish","type":"post","link":"https:\/\/azure.microsoft.com\/en-us\/blog\/the-intelligent-data-lake\/","title":{"rendered":"The Intelligent Data Lake"},"content":{"rendered":"\n<h2 class=\"wp-block-heading\" id=\"advanced-analytics-and-cognitive-intelligence-on-petabyte-sized-files-and-trillions-of-objects-with-azure-data-lake\">Advanced Analytics and Cognitive Intelligence on Petabyte sized files and trillions of objects with Azure Data Lake<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">Today we are announcing the general availability of Azure Data Lake, ushering in a new era of productivity for your big data developers and scientists. Fundamentally different from today\u2019s cluster-based solutions, the Azure Data Lake services enable you to securely store all your data centrally in a \u201cno limits\u201d data lake, and run on-demand analytics that instantly scales to your needs. Our state-of-the-art development environment and rich and extensible U-SQL language enable you to write, debug, and optimize massively parallel analytics programs in a fraction of the time of existing solutions.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"before-azure-data-lake\">Before Azure Data Lake<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">Traditional approaches for big data analytics constrain the productivity of your data developers and scientists due to time spent on infrastructure planning, and writing, debugging, &amp; optimizing code with primitive tooling. They also lack rich built-in cognitive capabilities like keyphrase extraction, sentiment analysis, image tagging, OCR, face detection, and emotion analysis. The underlying storage systems also impose challenges with artificial limits on file and account sizes requiring you to build workarounds. Additionally, your developer\u2019s valuable time is spent either optimizing the system or you end up overpaying for unused cluster capacity. The friction in these existing systems is so high, it effectively prevents companies from realizing the business transformation that Big Data promises.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"with-azure-data-lake\">With Azure Data Lake<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">With thousands of customers, Azure Data Lake has become one of the fastest growing Azure services. You can get started on this new era of big data productivity and scalability with the <strong>general availability<\/strong> of <strong>Azure Data Lake Analytics<\/strong> and <strong>Azure Data Lake Store<\/strong>.<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li class=\"wp-block-list-item\"><strong>Azure Data Lake Store<\/strong> \u2013 the first cloud Data Lake for enterprises that is secure, massively scalable and built to the open HDFS standard.\u00a0 With no limits to the size of data and the ability to run massively parallel analytics, you can now unlock value from all your unstructured, semi-structured and structured data.<\/li>\n\n\n\n<li class=\"wp-block-list-item\"><strong>Azure Data Lake Analytics<\/strong> \u2013 the first cloud analytics job service where you can easily develop and run massively parallel data transformation and processing programs in U-SQL, R, Python and .NET over petabytes of data. It has rich built-in cognitive capabilities such as image tagging, emotion detection, face detection, deriving meaning from text, and sentiment analysis with the ability to extend to any type of analytics. With Azure Data Lake Analytics, there is no infrastructure to manage, and you can process data on demand, scale instantly, and only pay per job.<\/li>\n\n\n\n<li class=\"wp-block-list-item\"><strong>Azure HDInsight<\/strong> \u2013 the only fully managed cloud Hadoop offering that provides optimized open source analytic clusters for Spark, Hive, Map Reduce, HBase, Storm, Kafka and R-Server backed by a 99.9% SLA. Today, we are announcing the general availability of R Server for HDInsight to do advanced analytics and predictive modelling with R+Spark. Further, we are introducing the public preview of Kafka for HDInsight, now the first managed cluster solution in the cloud for real-time ingestion with Kafka.<\/li>\n<\/ul>\n\n\n\n<figure class=\"wp-block-image aligncenter has-custom-border\"><img decoding=\"async\" src=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2016\/11\/571f2d03-6e94-43ad-b02c-5ec6b337a3d0.webp\" alt=\"Azure Data Lake_2\" style=\"border-radius:0px\" title=\"Azure Data Lake_2\" \/><\/figure>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"data-lake-store-a-no-limits-data-lake-that-powers-big-data-analytics\">Data Lake Store &#8211; A No Limits Data Lake that powers Big Data Analytics<\/h2>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"petabyte-sized-files-and-trillions-of-objects\">Petabyte sized files and trillions of objects<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">With Azure Data Lake Store your organization can now securely capture and analyze all data in a central location with no artificial constraints to limit the growth of your business. It can manage trillions of files where a single file can be greater than a petabyte in size &#8211; this is <strong>200x larger<\/strong> file size than other cloud object stores. Without the limits that constrain other cloud offerings, Data Lake Store is ideal for managing any type of data; including massive datasets like high-resolution video, genomic and seismic datasets, medical data, and data from a wide variety of industries. Data Lake Store is an enterprise data lake that can power your analytics today and in the future.<\/p>\n\n\n\n<div class=\"wp-block-group is-vertical is-content-justification-left is-nowrap is-layout-flex wp-container-core-group-is-layout-b9305f23 wp-block-group-is-layout-flex\"><figure class=\"wp-block-image size-full has-custom-border\"><img decoding=\"async\" src=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2016\/11\/2e09ca69-dda7-4cac-ae62-9de66eed1cf5.webp\" alt=\"icon\" class=\"wp-image-10064 webp-format\" style=\"border-radius:0px\" data-orig-src=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2016\/11\/2e09ca69-dda7-4cac-ae62-9de66eed1cf5.webp\"><\/figure>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p class=\"has-medium-font-size wp-block-paragraph\"><em>DS-IQ provides Dynamic Shopper Intelligence by curating data from large amounts of non-relational sources like weather, health, traffic, and economic trends so that we can give our customers actionable insights to drive the most effective marketing and service communications. Azure Data Lake was perfect for us because it could scale elastically, on-demand, to petabytes of data within minutes. This scalability and performance has impressed us, giving us confidence that it can handle the amounts of data we need to process today and, in the future, enable us to provide even more valuable, dynamic, context-aware experiences for our clients.\u201d<\/em><\/p>\n<cite>-William Wu, Chief Technology Officer at DS-IQ<\/cite><\/blockquote>\n<\/div>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"scalability-for-massively-parallel-analytics\">Scalability for massively parallel analytics<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">Data Lake Store provides massive throughput to run analytic jobs with thousands of concurrent executors that read and write hundreds of terabytes of data efficiently. You are no longer forced to redesign your application or repartition your data because Data Lake Store scales throughput to support any size of workload. Multiple services like Data Lake Analytics, HDInsight or HDFS compliant applications can efficiently analyze the same data simultaneously.<\/p>\n\n\n\n<div class=\"wp-block-group is-vertical is-nowrap is-layout-flex wp-container-core-group-is-layout-6fe931d8 wp-block-group-is-layout-flex\"><figure class=\"wp-block-image size-full has-custom-border\"><img decoding=\"async\" src=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2016\/11\/ad4c11ff-9c19-464f-ae65-c87f9f755908.webp\" alt=\"ecolab icon\" class=\"wp-image-10066 webp-format\" style=\"border-radius:0px\" data-orig-src=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2016\/11\/ad4c11ff-9c19-464f-ae65-c87f9f755908.webp\"><\/figure>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p class=\"has-medium-font-size wp-block-paragraph\"><em>Ecolab has an ambitious mission to find solutions to some of the world\u2019s biggest challenges \u2013 clean water, safe food, abundant energy and healthy environments.\u00a0 \u201cAzure Data Lake has been deployed to our water division where we are collecting real-time data from IoT devices so we can help our customers understand how they can reduce, reuse, and recycle water and at the same address one of the world\u2019s most pressing sustainability issues. We\u2019ve been impressed with Azure Data Lake because it allows us to store any amount of data we require and also lets us use our existing skills to analyze the data. Today, we have both groups who use open source technologies such as Spark in HDInsight to do analytics and other groups that use U-SQL, leveraging the extensibility of C# with the simplicity of SQL.\u201d<\/em><\/p>\n<cite>-Kevin Doyle, VP of IT, Global Industrial Solutions at Ecolab<\/cite><\/blockquote>\n<\/div>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"data-lake-analytics-a-on-demand-analytics-job-service-to-power-intelligent-action\">Data Lake Analytics &#8211; A On-Demand Analytics Job Service to power intelligent action<\/h2>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"start-in-seconds-scale-instantly-pay-per-job\">Start in seconds, scale instantly, pay per job<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">Our on-demand service will have your data developer and scientist processing big data jobs to power intelligent action in seconds. There is no infrastructure to worry about because there are no servers, VMs, or clusters to wait for, manage or tune. Instantly apply or adjust the analytic units (processing power) from one to hundreds or even thousands for each job. Only pay for the processing used per job, freeing valuable developer time from doing capacity planning and optimizations required in cluster-based systems that can take weeks to months.<\/p>\n\n\n\n<div class=\"wp-block-group is-vertical is-nowrap is-layout-flex wp-container-core-group-is-layout-6fe931d8 wp-block-group-is-layout-flex\"><figure class=\"wp-block-image size-full\"><img decoding=\"async\" src=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2016\/11\/320258ea-17cc-499b-997e-e8516f8bf8c1.webp\" alt=\"DeviceDesk icon\" class=\"wp-image-10068 webp-format\" data-orig-src=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2016\/11\/320258ea-17cc-499b-997e-e8516f8bf8c1.webp\"><\/figure>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p class=\"has-medium-font-size wp-block-paragraph\"><em>Azure Data Lake is instrumental because it helps Insightcentr ingest IoT-scale telemetry from PCs in real-time and gives detailed analytics to our customers without us spending millions of dollars building out big data clusters by scratch. We saw Azure Data Lake as the fastest and most scalable way we can get bring our customers these valuable insights to their business.\u201d<\/em><\/p>\n<cite>-Anthony Stevens, CEO Australian start-up Devicedesk<\/cite><\/blockquote>\n<\/div>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"develop-massively-parallel-analytic-programs-with-simplicity\">Develop massively parallel analytic programs with simplicity<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">U-SQL is an easy-to-use, highly expressive, and extensible language that allows you to write code once and automatically have it be parallelized for the scale you need. Instead of writing low-level code dealing with clusters, nodes, mappers, and reducers, etc., a developer writes a simple logical description of how data should be transformed for their business using both declarative and imperative techniques as desired. The U-SQL data processing system automatically parallelizes the code \u2013 enabling developers to control the amount of resources devoted to parallel computation with the simplicity of a slider. The U-SQL language is highly extensible and can reuse existing libraries written in a variety of languages like .NET languages, R, or Python. You can massively parallelize the code to process petabytes of data for diverse workload categories such as ETL, machine learning, feature engineering, image tagging, emotion detection, face detection, deriving meaning from text, and sentiment analysis.<\/p>\n\n\n\n<div class=\"wp-block-group is-vertical is-nowrap is-layout-flex wp-container-core-group-is-layout-6fe931d8 wp-block-group-is-layout-flex\"><figure class=\"wp-block-image size-full has-custom-border\"><img decoding=\"async\" src=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2016\/11\/9c20ae55-bb27-4010-addb-a3b5899e257d.webp\" alt=\"PureCars icon\" class=\"wp-image-10070 webp-format\" style=\"border-radius:0px\" data-orig-src=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2016\/11\/9c20ae55-bb27-4010-addb-a3b5899e257d.webp\"><\/figure>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p class=\"has-medium-font-size wp-block-paragraph\"><em>Azure Data Lake allows us to develop quickly on large sets of data with our current developer expertise. We have been able to leverage Azure Data Lake Analytics to capture and process large marketing audiences for our dynamic marketing platforms.&#8221;<\/em><\/p>\n<cite>-McPherson White, Director of Development at PureCars<\/cite><\/blockquote>\n<\/div>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"run-big-cognition-at-petabyte-scale\">Run Big Cognition at Petabyte Scale<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">Furthermore, we\u2019ve incorporated the technology that sits behind the <a href=\"https:\/\/www.microsoft.com\/cognitive-services\/en-us\/apis\" target=\"_blank\" rel=\"noopener\">Cognitive Services API<\/a> inside U-SQL directly. Now you can process any amount of unstructured data, e.g., text, images, and extract emotions, age, and all sorts of other cognitive features using Azure Data Lake and perform query by content. You can join emotions from image content with any other type of data you have and do incredibly powerful analytics and intelligence over it. This is what we call <strong><em>\u2018Big Cognition\u2019<\/em><\/strong>. It\u2019s not just extracting one piece of cognitive information at a time, not just about understanding an emotion or whether there\u2019s an object in an image, but rather it\u2019s about joining all the extracted cognitive data with other types of data, so you can do some really powerful analytics with it. We have demonstrated this capability at <a href=\"https:\/\/channel9.msdn.com\/Events\/Machine-Learning-and-Data-Sciences-Conference\/Data-Science-Summit-2016\" target=\"_blank\" rel=\"noopener\">Microsoft Ignite<\/a> and PASS Summit, by showing a Big Cognition demo in which we used U-SQL inside Azure Data Lake Analytics to <a href=\"https:\/\/blogs.msdn.microsoft.com\/azuredatalake\/2016\/08\/18\/introducing-image-processing-in-u-sql\/\" target=\"_blank\" rel=\"noopener\">process a million images<\/a> and understand what\u2019s inside those images. You can watch this demo <a href=\"https:\/\/www.youtube.com\/watch?v=UUulQYalpxU&amp;feature=youtu.be&amp;t=2268\" target=\"_blank\" rel=\"noopener\">here<\/a> and try it yourself using a <a href=\"https:\/\/github.com\/Azure\/usql\/tree\/master\/Examples\/ImageApp\" target=\"_blank\" rel=\"noopener\">sample project on GitHub<\/a>.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"debug-and-optimize-your-big-data-programs-with-ease\">Debug and optimize your Big Data programs with ease<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">With today\u2019s tools, developers face serious challenges debugging distributed programs. Azure Data Lake makes debugging failures in cloud distributed programs as easy as debugging a program in your personal environment using the powerful tools within Visual Studio. Developers no longer need to inspect thousands of logs on each machine searching for failures. When a U-SQL job fails, logs are automatically located, parsed, and filtered to the exact components involved in the failure and available as a visualization. Developers can even debug the specific parts of the U-SQL job that failed to their own local workstation without wasting time and money resubmitting jobs to the cloud. Our service can detect and analyze common performance problems that big data developers encounter such as imbalanced data partitioning and offers suggestions to fix your programs using the intelligence we\u2019ve gathered in the analysis of over a billion jobs in Microsoft\u2019s data lake.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Developers do a lot of heavy lifting when optimizing big data systems and frequently overpay for unused cluster capacity. Developers must manually optimize their data transformations, requiring them to carefully investigate how their data is transformed step-by-step, often manually ordering steps to gain improvements. Understanding performance and scale bottlenecks is challenging and requires distributed computing and infrastructure experts. For example, to improve performance, developers must carefully account for the time &amp; cost of data movement across a cluster and rewrite their queries or repartition their data. Data Lake\u2019s execution environment actively analyzes your programs as they run and offers recommendations to improve performance and reduce cost. For example, if you requested 1000 AUs for your program and only 50 AUs were needed, the system would recommend that you only use 50 AUs resulting in a 20x cost savings.<\/p>\n\n\n\n<div class=\"wp-block-group is-vertical is-nowrap is-layout-flex wp-container-core-group-is-layout-6fe931d8 wp-block-group-is-layout-flex\"><figure class=\"wp-block-image size-full has-custom-border\"><img decoding=\"async\" src=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2016\/11\/ff7c9ed4-ae4f-471c-9fec-75b728af2d46.webp\" alt=\"Plexure company\" class=\"wp-image-10072 webp-format\" style=\"border-radius:0px\" data-orig-src=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2016\/11\/ff7c9ed4-ae4f-471c-9fec-75b728af2d46.webp\"><\/figure>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p class=\"has-medium-font-size wp-block-paragraph\"><em>We ingest a massive amount of live data from mobile, web, IoT and retail transactions. Data Lake gives us the ability to easily and cost effectively store everything and analyse what we need to, when we need to. The simplicity of ramping up parallel processing on the U-SQL queries removes the technical complexities of fighting with the data and lets the teams focus on the business outcomes. We are now taking this a step further and exposing the powerful Data Lake tools directly to our clients in our software allowing them to more easily explore their data using these tools.\u201d<\/em><\/p>\n<cite>-David Inggs, CTO at Plexure<\/cite><\/blockquote>\n<\/div>\n\n\n\n<p class=\"wp-block-paragraph\">Today, we are also announcing the availability of this big data productivity environment in Visual Studio Code allowing users to have this type of productivity in a free cross-platform code editor that is available on Windows, Mac OS X, and Linux.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"azure-hdinsight-introduces-fully-managed-kafka-for-real-time-analytics-and-r-server-for-advanced-analytics\">Azure HDInsight introduces fully managed Kafka for real-time analytics and R Server for advanced analytics<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">At Strata + Hadoop World New York, we <a href=\"https:\/\/azure.microsoft.com\/en-us\/blog\/new-security-performance-and-isv-solutions-build-on-azure-hdinsight-s-leadership-to-make-hadoop-enterprise-ready-for-the-cloud\/\" target=\"_blank\" rel=\"noopener\">announced<\/a> new security, performance and ISV solutions that build on Azure HDInsight\u2019s leadership for enterprise-ready cloud Hadoop. Today, we are announcing the public preview of Kafka for HDInsight. This service lets you ingest massive amounts of real-time data and analyze that data with integration to Storm, Spark, for HDInsight and Azure IoT Hub to build end-to-end IoT, fraud detection, click-stream analysis, financial alerts, or social analytics solutions.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">We are also announcing the general availability of R Server for HDInsight. Running Microsoft R Server as a service on top of Apache Spark, developers can achieve unprecedented scale and performance with code that combines the familiarity of the open source R language and Spark. Multi-threaded math libraries and transparent parallelization in R Server enables handling up to 1000x more data and up to 50x faster speeds than open source R\u2014helping you train more accurate models for better predictions than previously possible. Newly available in GA is the inclusion of R Studio Server Community Edition out-of-the-box making it easy for data scientists to get started quickly.<\/p>\n\n\n\n<div class=\"wp-block-group is-vertical is-nowrap is-layout-flex wp-container-core-group-is-layout-6fe931d8 wp-block-group-is-layout-flex\"><figure class=\"wp-block-image size-full has-custom-border\"><img decoding=\"async\" src=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2016\/11\/94f8dff5-e545-4b92-a565-473e8fdfdbaa.webp\" alt=\"Milliman icon\" class=\"wp-image-10074 webp-format\" style=\"border-radius:0px\" data-orig-src=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2016\/11\/94f8dff5-e545-4b92-a565-473e8fdfdbaa.webp\"><\/figure>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p class=\"has-medium-font-size wp-block-paragraph\"><em>Milliman is among the world\u2019s largest providers of actuarial and related products and services, with offices in major cities around the globe. R Server for HDInsight, offers the ability for our clients to be able to forecast risk over much larger datasets than ever before, improving the accuracy of predictions, in a cost-efficient way. The familiarity of the R Programming language to our users, as well as the ability to spin up Hadoop and Spark clusters within minutes, running at unprecedented scale and performance, is what really gets me excited about R Server for HDInsight.\u201d<\/em><\/p>\n<cite>-Paul Maher, Chief Technology Officer of the Life Technology Solutions practice at Milliman<\/cite><\/blockquote>\n<\/div>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"enterprise-grade-security-auditing-and-support\">Enterprise-grade Security, Auditing and Support<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">Enterprise-grade big data solutions must meet uptime guarantees, stringent security, governance &amp; compliance requirements, and integrate with your existing IT investments. Data Lake services (Store, Analytics, and HDInsight) guarantee an industry-leading 99.9% uptime SLA and 24\/7 support for all Data Lake services. They are built with the highest levels of security for authentication, authorization, auditing, and encryption to give you peace-of-mind when storing and analyzing sensitive corporate data and intellectual property. Data is always encrypted; in motion using SSL, and at rest using service or user managed HSM-backed keys in Azure Key Vault. Capabilities such as single sign-on (SSO), multi-factor authentication and seamless management of your on-premises identity &amp; access management is built-in through Azure Active Directory. You can authorize users and groups with fine-grained POSIX-based ACLs for all data in the Store or with Apache Ranger in HDInsight enabling role-based access controls. Every access or configuration change is automatically audited for security and regulatory compliance requirements.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"supporting-open-source-and-open-standards\">Supporting open source and open standards<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">Microsoft continues to collaborate with the open source community reflected by our contributions to Apache <a href=\"https:\/\/azure.microsoft.com\/en-us\/blog\/behind-the-scenes-of-azure-data-lake-bringing-microsoft-s-big-data-experience-to-hadoop\/\" target=\"_blank\" rel=\"noopener\">Hadoop<\/a>, <a href=\"https:\/\/azure.microsoft.com\/en-us\/blog\/apache-spark-for-azure-hdinsight-now-generally-available\/\" target=\"_blank\" rel=\"noopener\">Spark<\/a>, Apache <a href=\"https:\/\/reef.apache.org\/\" target=\"_blank\" rel=\"noopener\">REEF<\/a> and our work with Jupyter notebooks. This is also the case with Azure Data Lake.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Azure Data Lake Analytics uses Apache YARN, the central part of Apache Hadoop to govern resource management and deliver consistent operations. To lead innovations to YARN, Microsoft has been a primary contributor to improve performance, scale, and made security innovations.<\/p>\n\n\n\n<div class=\"wp-block-group is-vertical is-nowrap is-layout-flex wp-container-core-group-is-layout-6fe931d8 wp-block-group-is-layout-flex\"><figure class=\"wp-block-image size-full has-custom-border\"><img decoding=\"async\" src=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2016\/11\/fd6df004-1318-4427-8419-a8a171a82331.webp\" alt=\"HortonWorks logo\" class=\"wp-image-10076 webp-format\" style=\"border-radius:0px\" data-orig-src=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2016\/11\/fd6df004-1318-4427-8419-a8a171a82331.webp\"><\/figure>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p class=\"has-medium-font-size wp-block-paragraph\"><em>Hortonworks and Microsoft have partnered closely for the past 5 years to further the Hadoop platform for big data analytics, including contributions to YARN, Hive, and other Apache projects.\u00a0 Azure Data Lake services, including Azure HDInsight and Azure Data Lake Store, demonstrate our shared commitment to make it easier for everyone to work with big data in an open and collaborative way.\u201d<\/em><\/p>\n<cite>-Shaun Connolly, Chief Strategy Officer at Hortonworks<\/cite><\/blockquote>\n<\/div>\n\n\n\n<p class=\"wp-block-paragraph\">Data Lake Store supports the open Apache Hadoop Distributed File System (HDFS) standard. Microsoft has also contributed improvements to HDFS such as OAuth 2.0 protocol support.<\/p>\n\n\n\n<div class=\"wp-block-group is-vertical is-nowrap is-layout-flex wp-container-core-group-is-layout-6fe931d8 wp-block-group-is-layout-flex\"><figure class=\"wp-block-image size-full has-custom-border\"><img decoding=\"async\" src=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2016\/11\/aa22628a-9c5a-4539-a463-c8eab2fb8ca9.webp\" alt=\"Cloudera icon\" class=\"wp-image-10078 webp-format\" style=\"border-radius:0px\" data-orig-src=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2016\/11\/aa22628a-9c5a-4539-a463-c8eab2fb8ca9.webp\"><\/figure>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p class=\"has-medium-font-size wp-block-paragraph\"><em>Cloudera is working closely with Microsoft to integrate Cloudera Enterprise with the Azure Data Lake Store. Cloudera on Azure will benefit from the Data Lake Store which acts as a cloud-based landing zone for all data in your enterprise data hub.\u00a0 Cloudera will leverage Data Lake and provide customers with a secure and flexible big data solution in the future.\u201d<\/em><\/p>\n<cite>-Mike Olson, founder and chief strategy officer at Cloudera<\/cite><\/blockquote>\n<\/div>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"leadership\">Leadership<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">Both industry analysts and customers recognize Microsoft\u2019s capabilities in big data. Forrester recently <a href=\"https:\/\/www.forrester.com\/report\/The+Forrester+Wave+Big+Data+Hadoop+Cloud+Solutions+Q2+2016\/-\/E-RES126541#figure4\" target=\"_blank\" rel=\"noopener\">recognized<\/a> Microsoft Azure as a leader in their Big Data Hadoop Cloud Solutions. Forrester notes that leaders have the most comprehensive, scalable, and integrated platforms. Microsoft specifically was called out for having a cloud-first strategy that is paying off.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"getting-started-with-data-lake\">Getting started with Data Lake<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\"><a href=\"https:\/\/azure.microsoft.com\/en-us\/services\/data-lake-analytics\/\" target=\"_blank\" rel=\"noopener\">Data Lake Analytics<\/a> and <a href=\"https:\/\/azure.microsoft.com\/en-us\/services\/data-lake-store\/\">Store<\/a> is generally available today. <a href=\"https:\/\/azure.microsoft.com\/en-us\/services\/hdinsight\/r-server\/\" target=\"_blank\" rel=\"noopener\">R Server for HDInsight<\/a> is also generally available today. <a href=\"https:\/\/azure.microsoft.com\/en-us\/services\/hdinsight\/apache-kafka\/\" target=\"_blank\" rel=\"noopener\">Kafka for HDInsight<\/a> is in public preview. Try it today individually or as part of <a href=\"https:\/\/www.microsoft.com\/en-us\/cloud-platform\/cortana-intelligence-suite\" target=\"_blank\" rel=\"noopener\">Cortana Intelligence Suite<\/a> to transform your data into intelligent action.<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li class=\"wp-block-list-item\">Read the <a href=\"https:\/\/azure.microsoft.com\/en-us\/services\/data-lake-analytics\/\" target=\"_blank\" rel=\"noopener\">overview<\/a>, <a href=\"https:\/\/azure.microsoft.com\/en-us\/pricing\/details\/data-lake-analytics\/\" target=\"_blank\" rel=\"noopener\">pricing<\/a> and <a href=\"https:\/\/azure.microsoft.com\/en-us\/documentation\/services\/data-lake-analytics\/\" target=\"_blank\" rel=\"noopener\">getting started<\/a> pages of Data Lake Analytics or attend the <a href=\"https:\/\/mva.microsoft.com\/en-US\/training-courses\/introducing-azure-data-lake-16910?l=hydSToaDD_9306218965\" target=\"_blank\" rel=\"noopener\">free course<\/a><\/li>\n\n\n\n<li class=\"wp-block-list-item\">Read the <a href=\"https:\/\/azure.microsoft.com\/en-us\/services\/data-lake-store\/\" target=\"_blank\" rel=\"noopener\">overview<\/a>, <a href=\"https:\/\/azure.microsoft.com\/en-us\/pricing\/details\/data-lake-store\/\" target=\"_blank\" rel=\"noopener\">pricing<\/a> and <a href=\"https:\/\/azure.microsoft.com\/en-us\/documentation\/services\/data-lake-store\/\" target=\"_blank\" rel=\"noopener\">getting started<\/a> pages of Data Lake Store<\/li>\n\n\n\n<li class=\"wp-block-list-item\">Read the <a href=\"https:\/\/azure.microsoft.com\/en-us\/services\/hdinsight\/r-server\/\" target=\"_blank\" rel=\"noopener\">R Server<\/a>, <a href=\"https:\/\/azure.microsoft.com\/en-us\/services\/hdinsight\/apache-kafka\/\" target=\"_blank\" rel=\"noopener\">Kafka<\/a> overview and <a href=\"https:\/\/azure.microsoft.com\/en-us\/pricing\/details\/hdinsight\/\" target=\"_blank\" rel=\"noopener\">pricing<\/a> pages of HDInsight<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">@josephsirosh<\/p>\n","protected":false},"excerpt":{"rendered":"<p>The general availability of Azure Data Lake ushers in a new era of productivity for your big data developers and scientists.<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"ms_queue_id":[],"ep_exclude_from_search":false,"_classifai_error":"","_classifai_text_to_speech_error":"","_alt_title":"","footnotes":"","msx_community_cta_settings":[]},"categories":[1474,1491],"tags":[48],"audience":[3054,3057,3053],"content-type":[1465],"product":[1610],"tech-community":[],"topic":[],"coauthors":[97],"class_list":["post-4326","post","type-post","status-publish","format-standard","hentry","category-analytics","category-storage","tag-big-data","audience-business-decision-makers","audience-data-professionals","audience-it-decision-makers","content-type-announcements","product-azure-data-lake-storage","review-flag-1680286581-295","review-flag-1680286581-56","review-flag-1680286581-364","review-flag-2-1680286581-601","review-flag-24-7-1680286585-656","review-flag-5-1680286581-950","review-flag-7-1680286581-146","review-flag-9-1680286581-259","review-flag-alway-1680286580-106","review-flag-forre-1680286585-445","review-flag-free-1680286579-836","review-flag-ga-1680286584-289","review-flag-gener-1680286584-335","review-flag-integ-1680286579-214","review-flag-iot-1680286585-835","review-flag-lever-1680286579-649","review-flag-machi-1680286585-314","review-flag-microsofts","review-flag-new-1680286579-546","review-flag-publi-1680286584-566"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.2 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>The Intelligent Data Lake | Microsoft Azure Blog<\/title>\n<meta name=\"description\" content=\"The general availability of Azure Data Lake ushers in a new era of productivity for your big data developers and scientists.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/azure.microsoft.com\/en-us\/blog\/the-intelligent-data-lake\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"The Intelligent Data Lake | Microsoft Azure Blog\" \/>\n<meta property=\"og:description\" content=\"The general availability of Azure Data Lake ushers in a new era of productivity for your big data developers and scientists.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/azure.microsoft.com\/en-us\/blog\/the-intelligent-data-lake\/\" \/>\n<meta property=\"og:site_name\" content=\"Microsoft Azure Blog\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/microsoftazure\" \/>\n<meta property=\"article:published_time\" content=\"2016-11-16T00:00:00+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2025-06-17T16:50:10+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2016\/11\/571f2d03-6e94-43ad-b02c-5ec6b337a3d0.webp\" \/>\n<meta name=\"author\" content=\"Microsoft Azure\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@azure\" \/>\n<meta name=\"twitter:site\" content=\"@azure\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Microsoft Azure\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"11 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/the-intelligent-data-lake\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/the-intelligent-data-lake\/\"},\"author\":[{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/author\/microsoft-azure\/\",\"@type\":\"Person\",\"@name\":\"Microsoft Azure\"}],\"headline\":\"The Intelligent Data Lake\",\"datePublished\":\"2016-11-16T00:00:00+00:00\",\"dateModified\":\"2025-06-17T16:50:10+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/the-intelligent-data-lake\/\"},\"wordCount\":2829,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization\"},\"image\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/the-intelligent-data-lake\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2016\/11\/571f2d03-6e94-43ad-b02c-5ec6b337a3d0.webp\",\"keywords\":[\"Big Data\"],\"articleSection\":[\"Analytics\",\"Storage\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/azure.microsoft.com\/en-us\/blog\/the-intelligent-data-lake\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/the-intelligent-data-lake\/\",\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/the-intelligent-data-lake\/\",\"name\":\"The Intelligent Data Lake | Microsoft Azure Blog\",\"isPartOf\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/the-intelligent-data-lake\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/the-intelligent-data-lake\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2016\/11\/571f2d03-6e94-43ad-b02c-5ec6b337a3d0.webp\",\"datePublished\":\"2016-11-16T00:00:00+00:00\",\"dateModified\":\"2025-06-17T16:50:10+00:00\",\"description\":\"The general availability of Azure Data Lake ushers in a new era of productivity for your big data developers and scientists.\",\"breadcrumb\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/the-intelligent-data-lake\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/azure.microsoft.com\/en-us\/blog\/the-intelligent-data-lake\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/the-intelligent-data-lake\/#primaryimage\",\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2016\/11\/571f2d03-6e94-43ad-b02c-5ec6b337a3d0.webp\",\"contentUrl\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2016\/11\/571f2d03-6e94-43ad-b02c-5ec6b337a3d0.webp\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/the-intelligent-data-lake\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Blog home\",\"item\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Analytics\",\"item\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/category\/analytics\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"The Intelligent Data Lake\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#website\",\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/\",\"name\":\"Microsoft Azure Blog\",\"description\":\"Get the latest Azure news, updates, and announcements from the Azure blog. From product updates to hot topics, hear from the Azure experts.\",\"publisher\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization\",\"name\":\"Microsoft Azure Blog\",\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/06\/microsoft_logo.webp\",\"contentUrl\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/06\/microsoft_logo.webp\",\"width\":512,\"height\":512,\"caption\":\"Microsoft Azure Blog\"},\"image\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/www.facebook.com\/microsoftazure\",\"https:\/\/x.com\/azure\",\"https:\/\/www.instagram.com\/microsoftdeveloper\/\",\"https:\/\/www.linkedin.com\/company\/16188386\",\"https:\/\/www.youtube.com\/user\/windowsazure\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/person\/c702e5edd662b328b49b7e1180cab117\",\"name\":\"shakir\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/secure.gravatar.com\/avatar\/9342c7c05bb16548741bc5cd3a3e3b7ee0c8e746844ad2cc582db5beb5514c6f?s=96&d=mm&r=g7664e653ea371ce16eaf75e9fa8952c4\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/9342c7c05bb16548741bc5cd3a3e3b7ee0c8e746844ad2cc582db5beb5514c6f?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/9342c7c05bb16548741bc5cd3a3e3b7ee0c8e746844ad2cc582db5beb5514c6f?s=96&d=mm&r=g\",\"caption\":\"shakir\"},\"sameAs\":[\"https:\/\/azure.microsoft.com\"],\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/author\/shakir\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"The Intelligent Data Lake | Microsoft Azure Blog","description":"The general availability of Azure Data Lake ushers in a new era of productivity for your big data developers and scientists.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/azure.microsoft.com\/en-us\/blog\/the-intelligent-data-lake\/","og_locale":"en_US","og_type":"article","og_title":"The Intelligent Data Lake | Microsoft Azure Blog","og_description":"The general availability of Azure Data Lake ushers in a new era of productivity for your big data developers and scientists.","og_url":"https:\/\/azure.microsoft.com\/en-us\/blog\/the-intelligent-data-lake\/","og_site_name":"Microsoft Azure Blog","article_publisher":"https:\/\/www.facebook.com\/microsoftazure","article_published_time":"2016-11-16T00:00:00+00:00","article_modified_time":"2025-06-17T16:50:10+00:00","og_image":[{"url":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2016\/11\/571f2d03-6e94-43ad-b02c-5ec6b337a3d0.webp","type":"","width":"","height":""}],"author":"Microsoft Azure","twitter_card":"summary_large_image","twitter_creator":"@azure","twitter_site":"@azure","twitter_misc":{"Written by":"Microsoft Azure","Est. reading time":"11 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/the-intelligent-data-lake\/#article","isPartOf":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/the-intelligent-data-lake\/"},"author":[{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/author\/microsoft-azure\/","@type":"Person","@name":"Microsoft Azure"}],"headline":"The Intelligent Data Lake","datePublished":"2016-11-16T00:00:00+00:00","dateModified":"2025-06-17T16:50:10+00:00","mainEntityOfPage":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/the-intelligent-data-lake\/"},"wordCount":2829,"commentCount":0,"publisher":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization"},"image":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/the-intelligent-data-lake\/#primaryimage"},"thumbnailUrl":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2016\/11\/571f2d03-6e94-43ad-b02c-5ec6b337a3d0.webp","keywords":["Big Data"],"articleSection":["Analytics","Storage"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/azure.microsoft.com\/en-us\/blog\/the-intelligent-data-lake\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/the-intelligent-data-lake\/","url":"https:\/\/azure.microsoft.com\/en-us\/blog\/the-intelligent-data-lake\/","name":"The Intelligent Data Lake | Microsoft Azure Blog","isPartOf":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/the-intelligent-data-lake\/#primaryimage"},"image":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/the-intelligent-data-lake\/#primaryimage"},"thumbnailUrl":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2016\/11\/571f2d03-6e94-43ad-b02c-5ec6b337a3d0.webp","datePublished":"2016-11-16T00:00:00+00:00","dateModified":"2025-06-17T16:50:10+00:00","description":"The general availability of Azure Data Lake ushers in a new era of productivity for your big data developers and scientists.","breadcrumb":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/the-intelligent-data-lake\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/azure.microsoft.com\/en-us\/blog\/the-intelligent-data-lake\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/the-intelligent-data-lake\/#primaryimage","url":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2016\/11\/571f2d03-6e94-43ad-b02c-5ec6b337a3d0.webp","contentUrl":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2016\/11\/571f2d03-6e94-43ad-b02c-5ec6b337a3d0.webp"},{"@type":"BreadcrumbList","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/the-intelligent-data-lake\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Blog home","item":"https:\/\/azure.microsoft.com\/en-us\/blog\/"},{"@type":"ListItem","position":2,"name":"Analytics","item":"https:\/\/azure.microsoft.com\/en-us\/blog\/category\/analytics\/"},{"@type":"ListItem","position":3,"name":"The Intelligent Data Lake"}]},{"@type":"WebSite","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#website","url":"https:\/\/azure.microsoft.com\/en-us\/blog\/","name":"Microsoft Azure Blog","description":"Get the latest Azure news, updates, and announcements from the Azure blog. From product updates to hot topics, hear from the Azure experts.","publisher":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/azure.microsoft.com\/en-us\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization","name":"Microsoft Azure Blog","url":"https:\/\/azure.microsoft.com\/en-us\/blog\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/06\/microsoft_logo.webp","contentUrl":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/06\/microsoft_logo.webp","width":512,"height":512,"caption":"Microsoft Azure Blog"},"image":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/microsoftazure","https:\/\/x.com\/azure","https:\/\/www.instagram.com\/microsoftdeveloper\/","https:\/\/www.linkedin.com\/company\/16188386","https:\/\/www.youtube.com\/user\/windowsazure"]},{"@type":"Person","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/person\/c702e5edd662b328b49b7e1180cab117","name":"shakir","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/secure.gravatar.com\/avatar\/9342c7c05bb16548741bc5cd3a3e3b7ee0c8e746844ad2cc582db5beb5514c6f?s=96&d=mm&r=g7664e653ea371ce16eaf75e9fa8952c4","url":"https:\/\/secure.gravatar.com\/avatar\/9342c7c05bb16548741bc5cd3a3e3b7ee0c8e746844ad2cc582db5beb5514c6f?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/9342c7c05bb16548741bc5cd3a3e3b7ee0c8e746844ad2cc582db5beb5514c6f?s=96&d=mm&r=g","caption":"shakir"},"sameAs":["https:\/\/azure.microsoft.com"],"url":"https:\/\/azure.microsoft.com\/en-us\/blog\/author\/shakir\/"}]}},"msxcm_display_generated_audio":false,"msxcm_animated_featured_image":null,"distributor_meta":false,"distributor_terms":false,"distributor_media":false,"distributor_original_site_name":"Microsoft Azure Blog","distributor_original_site_url":"https:\/\/azure.microsoft.com\/en-us\/blog","push-errors":false,"_links":{"self":[{"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/posts\/4326","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/comments?post=4326"}],"version-history":[{"count":2,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/posts\/4326\/revisions"}],"predecessor-version":[{"id":42118,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/posts\/4326\/revisions\/42118"}],"wp:attachment":[{"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/media?parent=4326"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/categories?post=4326"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/tags?post=4326"},{"taxonomy":"audience","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/audience?post=4326"},{"taxonomy":"content-type","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/content-type?post=4326"},{"taxonomy":"product","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/product?post=4326"},{"taxonomy":"tech-community","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/tech-community?post=4326"},{"taxonomy":"topic","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/topic?post=4326"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/coauthors?post=4326"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}