{"id":1883,"date":"2018-12-10T00:00:00","date_gmt":"2018-12-10T00:00:00","guid":{"rendered":"https:\/\/azure.microsoft.com\/blog\/azure-hdinsight-integration-with-data-lake-storage-gen-2-preview-acl-and-security-update"},"modified":"2023-05-11T15:35:29","modified_gmt":"2023-05-11T22:35:29","slug":"azure-hdinsight-integration-with-data-lake-storage-gen-2-preview-acl-and-security-update","status":"publish","type":"post","link":"https:\/\/azure.microsoft.com\/en-us\/blog\/azure-hdinsight-integration-with-data-lake-storage-gen-2-preview-acl-and-security-update\/","title":{"rendered":"Azure HDInsight integration with Data Lake Storage Gen2 preview &#8211; ACL and security update"},"content":{"rendered":"<p>Today we are sharing an update to the Azure HDInsight integration with Azure Data Lake Storage Gen 2. This integration will enable HDInsight customers to drive analytics from the data stored in Azure Data Lake Storage Gen 2 using popular open source frameworks such as Apache Spark, Hive, MapReduce, Kafka, Storm, and HBase in a secure manner.<\/p>\n<h2>Azure Data Lake Storage Gen2<\/h2>\n<p><a href=\"https:\/\/azure.microsoft.com\/en-us\/services\/storage\/data-lake-storage\/\" target=\"_blank\" rel=\"noopener\">Azure Data Lake Storage Gen2<\/a> is the only data lake designed specifically for enterprises to run large scale analytics workloads in the cloud. It unifies the core capabilities from the first generation of Azure Data Lake with a Hadoop compatible file system endpoint now directly integrated into Azure Blob Storage. This enhancement combines the scale and cost benefits of object storage with the reliability and performance typically associated only with on-premises file systems. This new file system includes a full hierarchical namespace that makes files and folders first class citizens, translating to faster, more reliable analytics job execution.<\/p>\n<p>Azure Data Lake Storage Gen2 also includes limitless storage ensuring capacity to meet the needs of even the largest, most complex workloads. In addition, Azure Data Lake Storage Gen2 delivers on native integration with Azure Active Directory and support POSIX compliant ACLs to enable granular permission assignments on files and folders.<\/p>\n<h2>Key benefits<\/h2>\n<h3>Hadoop compatible access<\/h3>\n<p>Azure Data Lake Storage Gen2 allows you to manage and access data just as you would with a Hadoop Distributed File System (HDFS). The ABFS driver is available within all Apache Hadoop environments. File systems are well understood by developers and users alike. There is no need to learn a new storage paradigm when you move to the cloud as the file system interface exposed by Azure Data Lake Storage Gen2 is the same paradigm used by computers, large and small.<\/p>\n<h3>Role based access control<\/h3>\n<p>The security model for Azure Data Lake Storage Gen2 supports ACL and POSIX permissions.<\/p>\n<p>These storage ACL capabilities along with fine grain access control via apache Ranger in HDInsight for applications such as Spark, Kafka, Hive, and HBase make it very convenient to open up your data lake for entire organization with appropriate security control and auditing in place.<\/p>\n<h3>SSL only access<\/h3>\n<p>With this update, ADLS Gen 2 accounts can only be accessed via https protocol ensuring that only encrypted communication is possible between HDInsight and storage.<\/p>\n<h3>Global availability<\/h3>\n<p>Azure Data Lake Storage Gen 2 and HDInsight are available across the globe, offering the scale needed to bring big data applications closer to users around the world, preserving data residency, and offering comprehensive compliance and resiliency options for customers.<\/p>\n<h3>Atomic directory manipulation<\/h3>\n<p>Object stores approximate a directory hierarchy by adopting a convention of embedding slashes (\/) in the object name to denote path segments. While this convention works for organizing objects, the convention provides no assistance for actions like moving, renaming, or deleting directories. Without real directories, applications must process potentially millions of individual blobs to achieve directory-level tasks. By contrast, the <a href=\"https:\/\/docs.microsoft.com\/en-us\/azure\/storage\/data-lake-storage\/namespace\" target=\"_blank\" rel=\"noopener\">hierarchical namespace<\/a> processes these tasks by updating a single entry (the parent directory).<\/p>\n<p>This dramatic optimization is especially significant for many big data analytics frameworks. Tools like Hive and Spark often write output to temporary locations and then rename the location at the conclusion of the job. Without the hierarchical namespace, this rename can often take longer than the analytics process itself. Lower job latency equals lower total cost of ownership (TCO) for analytics workloads.<\/p>\n<h3>Scale<\/h3>\n<p>HDInsight and Azure Data Lake Storage Gen2 bring new levels of scale for big data workloads. Customers can run workloads that scale at 100\u2019s Gb\/Sec to Petabytes of storage without needing to shard the data across multiple storage accounts.<\/p>\n<h3>Encryption at REST<\/h3>\n<p>Encryption in Azure Data Lake Storage Gen2 helps you protect your data, implement enterprise security policies, and meet regulatory compliance requirements. Azure Data Lake Storage Gen 2 supports encryption of data both at rest and in transit.<\/p>\n<h3>Network firewall<\/h3>\n<p>Integrated <a href=\"https:\/\/docs.microsoft.com\/en-us\/azure\/storage\/common\/storage-network-security\" target=\"_blank\" rel=\"noopener\">network firewall capabilities<\/a> allow you to define rules restricting access only to requests originating from specified networks or HDInsight clusters in a specific VNET.<\/p>\n<h2>How does the integration work?<\/h2>\n<p>HDInsight and Azure Data Lake Storage Gen2 integration is based upon <a href=\"https:\/\/docs.microsoft.com\/en-us\/azure\/active-directory\/managed-identities-azure-resources\/overview\" target=\"_blank\" rel=\"noopener\">user-assigned managed identity<\/a>. You assign appropriate access to HDInsight with your Azure Data Lake Storage Gen2 accounts. Once configured, your HDInsight cluster is able to use Azure Data Lake Storage Gen2 as its storage.<\/p>\n<p>1. Create an Azure storage account and enable Data Lake Storage Gen 2 preview.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" alt=\"Storage configuration in Data Lake Storage Gen 2\" height=\"309\" src=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2018\/12\/80d198fd-aedf-4640-8cde-3e62796e8473.webp\" title=\"Storage configuration in Data Lake Storage Gen 2\" width=\"584\"><\/p>\n<p>2. Create a user assigned managed identity.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" alt=\"Creating a user managed identity \" height=\"313\" src=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2018\/12\/abadaf17-101d-4a56-8f66-479332fd89ff.webp\" title=\"Creating a user managed identity \" width=\"306\"><\/p>\n<p>3. Assign Storage Blob Data Owner access to the created managed identity on Azure Storage.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" alt=\"Assigning Storage Blob Data Owner access \" height=\"601\" src=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2018\/12\/1e532398-e33f-4862-b582-b6d8adaf1ded.webp\" title=\"Assigning Storage Blob Data Owner access \" width=\"2308\"><\/p>\n<p>4. Now you can proceed to creating HDInsight cluster. In the storage blade, select the storage account, and the associated managed user identity, and proceed with cluster creation workflow.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" alt=\"Selecting the storage account and associated managed user identity\" height=\"491\" src=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2018\/12\/00d1be40-acab-49c1-9ffd-caed55998bb5.webp\" title=\"Selecting the storage account and associated managed user identity\" width=\"494\"><\/p>\n<h2>Getting started<\/h2>\n<p>Start using <a href=\"https:\/\/docs.microsoft.com\/azure\/hdinsight\/hdinsight-hadoop-data-lake-storage-gen2\" target=\"_blank\" rel=\"noopener\">Azure Data Lake Storage Gen2 with Azure HDInsight today<\/a>.<\/p>\n<h2>Feedback<\/h2>\n<p>We look forward to your comments and feedback. If there are any feature requests, customer asks, or suggestions, please contact us at <a target=\"_blank\" rel=\"noopener\">askhdinsight@microsoft.com<\/a>.<\/p>\n<h2>Additional resources<\/h2>\n<ul>\n<li><a href=\"https:\/\/docs.microsoft.com\/en-us\/azure\/storage\/blobs\/data-lake-storage-introduction?toc=%2fazure%2fstorage%2fblobs%2ftoc.json\" target=\"_blank\" rel=\"noopener\">Azure Data Lake Storage Gen2 introduction<\/a><\/li>\n<li><a href=\"https:\/\/docs.microsoft.com\/en-us\/azure\/storage\/blobs\/data-lake-storage-namespace?toc=%2fazure%2fstorage%2fblobs%2ftoc.json\" target=\"_blank\" rel=\"noopener\">Hierarchical Namespace concept<\/a><\/li>\n<li><a href=\"https:\/\/docs.microsoft.com\/en-us\/azure\/storage\/blobs\/data-lake-storage-quickstart-create-connect-hdi-cluster?toc=%2fazure%2fstorage%2fblobs%2ftoc.json\" target=\"_blank\" rel=\"noopener\">Create HDInsight cluster with ADLS Gen2<\/a><\/li>\n<li>Learn more about <a href=\"https:\/\/docs.microsoft.com\/en-us\/azure\/hdinsight\/\" target=\"_blank\" rel=\"noopener\">Azure HDInsight<\/a>.<\/li>\n<li>Read the <a href=\"https:\/\/docs.microsoft.com\/en-us\/azure\/hdinsight\/hdinsight-component-versioning#hadoop-components-available-with-different-hdinsight-versions\" target=\"_blank\" rel=\"noopener\">Open Source component guide on HDInsight<\/a>.<\/li>\n<li>Review the <a href=\"https:\/\/docs.microsoft.com\/en-us\/azure\/hdinsight\/hdinsight-release-notes\" target=\"_blank\" rel=\"noopener\">HDInsight release notes<\/a>.<\/li>\n<li>Ask HDInsight questions on <a href=\"https:\/\/social.msdn.microsoft.com\/forums\/azure\/en-us\/home?forum=hdinsight\" target=\"_blank\" rel=\"noopener\">MSDN forums<\/a>.<\/li>\n<li>Ask HDInsight questions on <a href=\"https:\/\/stackoverflow.com\/questions\/tagged\/hdinsight\" target=\"_blank\" rel=\"noopener\">StackOverflow<\/a>.<\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>Today we are sharing an update to the Azure HDInsight integration with Azure Data Lake Storage Gen 2. This integration will enable HDInsight customers to drive analytics from the data stored in Azure Data Lake Storage Gen 2 using popular open source frameworks such as Apache Spark, Hive, MapReduce, Kafka, Storm, and HBase in a secure manner.<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"ms_queue_id":[],"ep_exclude_from_search":false,"_classifai_error":"","_classifai_text_to_speech_error":"","_alt_title":"","footnotes":"","msx_community_cta_settings":[]},"categories":[1474,1491],"tags":[],"audience":[3054,3057,3053,3056],"content-type":[],"product":[1610,2895],"tech-community":[],"topic":[],"coauthors":[97],"class_list":["post-1883","post","type-post","status-publish","format-standard","hentry","category-analytics","category-storage","audience-business-decision-makers","audience-data-professionals","audience-it-decision-makers","audience-it-implementors","product-azure-data-lake-storage","product-azure-hdinsight-on-azure-kubernetes-service-aks"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.2 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Azure HDInsight integration with Data Lake Storage Gen2 preview - ACL and security update | Microsoft Azure Blog<\/title>\n<meta name=\"description\" content=\"Today we are sharing an update to the Azure HDInsight integration with Azure Data Lake Storage Gen 2. This integration will enable HDInsight customers to drive analytics from the data stored in Azure Data Lake Storage Gen 2 using popular open source frameworks such as Apache Spark, Hive, MapReduce, Kafka, Storm, and HBase in a secure manner.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/azure.microsoft.com\/en-us\/blog\/azure-hdinsight-integration-with-data-lake-storage-gen-2-preview-acl-and-security-update\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Azure HDInsight integration with Data Lake Storage Gen2 preview - ACL and security update | Microsoft Azure Blog\" \/>\n<meta property=\"og:description\" content=\"Today we are sharing an update to the Azure HDInsight integration with Azure Data Lake Storage Gen 2. This integration will enable HDInsight customers to drive analytics from the data stored in Azure Data Lake Storage Gen 2 using popular open source frameworks such as Apache Spark, Hive, MapReduce, Kafka, Storm, and HBase in a secure manner.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/azure.microsoft.com\/en-us\/blog\/azure-hdinsight-integration-with-data-lake-storage-gen-2-preview-acl-and-security-update\/\" \/>\n<meta property=\"og:site_name\" content=\"Microsoft Azure Blog\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/microsoftazure\" \/>\n<meta property=\"article:published_time\" content=\"2018-12-10T00:00:00+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2023-05-11T22:35:29+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2018\/12\/80d198fd-aedf-4640-8cde-3e62796e8473.webp\" \/>\n<meta name=\"author\" content=\"Microsoft Azure\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@azure\" \/>\n<meta name=\"twitter:site\" content=\"@azure\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Microsoft Azure\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"4 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/azure-hdinsight-integration-with-data-lake-storage-gen-2-preview-acl-and-security-update\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/azure-hdinsight-integration-with-data-lake-storage-gen-2-preview-acl-and-security-update\/\"},\"author\":[{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/author\/microsoft-azure\/\",\"@type\":\"Person\",\"@name\":\"Microsoft Azure\"}],\"headline\":\"Azure HDInsight integration with Data Lake Storage Gen2 preview &#8211; ACL and security update\",\"datePublished\":\"2018-12-10T00:00:00+00:00\",\"dateModified\":\"2023-05-11T22:35:29+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/azure-hdinsight-integration-with-data-lake-storage-gen-2-preview-acl-and-security-update\/\"},\"wordCount\":879,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization\"},\"image\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/azure-hdinsight-integration-with-data-lake-storage-gen-2-preview-acl-and-security-update\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2018\/12\/80d198fd-aedf-4640-8cde-3e62796e8473.webp\",\"articleSection\":[\"Analytics\",\"Storage\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/azure.microsoft.com\/en-us\/blog\/azure-hdinsight-integration-with-data-lake-storage-gen-2-preview-acl-and-security-update\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/azure-hdinsight-integration-with-data-lake-storage-gen-2-preview-acl-and-security-update\/\",\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/azure-hdinsight-integration-with-data-lake-storage-gen-2-preview-acl-and-security-update\/\",\"name\":\"Azure HDInsight integration with Data Lake Storage Gen2 preview - ACL and security update | Microsoft Azure Blog\",\"isPartOf\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/azure-hdinsight-integration-with-data-lake-storage-gen-2-preview-acl-and-security-update\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/azure-hdinsight-integration-with-data-lake-storage-gen-2-preview-acl-and-security-update\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2018\/12\/80d198fd-aedf-4640-8cde-3e62796e8473.webp\",\"datePublished\":\"2018-12-10T00:00:00+00:00\",\"dateModified\":\"2023-05-11T22:35:29+00:00\",\"description\":\"Today we are sharing an update to the Azure HDInsight integration with Azure Data Lake Storage Gen 2. This integration will enable HDInsight customers to drive analytics from the data stored in Azure Data Lake Storage Gen 2 using popular open source frameworks such as Apache Spark, Hive, MapReduce, Kafka, Storm, and HBase in a secure manner.\",\"breadcrumb\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/azure-hdinsight-integration-with-data-lake-storage-gen-2-preview-acl-and-security-update\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/azure.microsoft.com\/en-us\/blog\/azure-hdinsight-integration-with-data-lake-storage-gen-2-preview-acl-and-security-update\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/azure-hdinsight-integration-with-data-lake-storage-gen-2-preview-acl-and-security-update\/#primaryimage\",\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2018\/12\/80d198fd-aedf-4640-8cde-3e62796e8473.webp\",\"contentUrl\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2018\/12\/80d198fd-aedf-4640-8cde-3e62796e8473.webp\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/azure-hdinsight-integration-with-data-lake-storage-gen-2-preview-acl-and-security-update\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Blog home\",\"item\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Analytics\",\"item\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/category\/analytics\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"Azure HDInsight integration with Data Lake Storage Gen2 preview &#8211; ACL and security update\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#website\",\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/\",\"name\":\"Microsoft Azure Blog\",\"description\":\"Get the latest Azure news, updates, and announcements from the Azure blog. From product updates to hot topics, hear from the Azure experts.\",\"publisher\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization\",\"name\":\"Microsoft Azure Blog\",\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/06\/microsoft_logo.webp\",\"contentUrl\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/06\/microsoft_logo.webp\",\"width\":512,\"height\":512,\"caption\":\"Microsoft Azure Blog\"},\"image\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/www.facebook.com\/microsoftazure\",\"https:\/\/x.com\/azure\",\"https:\/\/www.instagram.com\/microsoftdeveloper\/\",\"https:\/\/www.linkedin.com\/company\/16188386\",\"https:\/\/www.youtube.com\/user\/windowsazure\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/person\/c702e5edd662b328b49b7e1180cab117\",\"name\":\"shakir\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/secure.gravatar.com\/avatar\/9342c7c05bb16548741bc5cd3a3e3b7ee0c8e746844ad2cc582db5beb5514c6f?s=96&d=mm&r=g7664e653ea371ce16eaf75e9fa8952c4\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/9342c7c05bb16548741bc5cd3a3e3b7ee0c8e746844ad2cc582db5beb5514c6f?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/9342c7c05bb16548741bc5cd3a3e3b7ee0c8e746844ad2cc582db5beb5514c6f?s=96&d=mm&r=g\",\"caption\":\"shakir\"},\"sameAs\":[\"https:\/\/azure.microsoft.com\"],\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/author\/shakir\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Azure HDInsight integration with Data Lake Storage Gen2 preview - ACL and security update | Microsoft Azure Blog","description":"Today we are sharing an update to the Azure HDInsight integration with Azure Data Lake Storage Gen 2. This integration will enable HDInsight customers to drive analytics from the data stored in Azure Data Lake Storage Gen 2 using popular open source frameworks such as Apache Spark, Hive, MapReduce, Kafka, Storm, and HBase in a secure manner.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/azure.microsoft.com\/en-us\/blog\/azure-hdinsight-integration-with-data-lake-storage-gen-2-preview-acl-and-security-update\/","og_locale":"en_US","og_type":"article","og_title":"Azure HDInsight integration with Data Lake Storage Gen2 preview - ACL and security update | Microsoft Azure Blog","og_description":"Today we are sharing an update to the Azure HDInsight integration with Azure Data Lake Storage Gen 2. This integration will enable HDInsight customers to drive analytics from the data stored in Azure Data Lake Storage Gen 2 using popular open source frameworks such as Apache Spark, Hive, MapReduce, Kafka, Storm, and HBase in a secure manner.","og_url":"https:\/\/azure.microsoft.com\/en-us\/blog\/azure-hdinsight-integration-with-data-lake-storage-gen-2-preview-acl-and-security-update\/","og_site_name":"Microsoft Azure Blog","article_publisher":"https:\/\/www.facebook.com\/microsoftazure","article_published_time":"2018-12-10T00:00:00+00:00","article_modified_time":"2023-05-11T22:35:29+00:00","og_image":[{"url":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2018\/12\/80d198fd-aedf-4640-8cde-3e62796e8473.webp","type":"","width":"","height":""}],"author":"Microsoft Azure","twitter_card":"summary_large_image","twitter_creator":"@azure","twitter_site":"@azure","twitter_misc":{"Written by":"Microsoft Azure","Est. reading time":"4 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/azure-hdinsight-integration-with-data-lake-storage-gen-2-preview-acl-and-security-update\/#article","isPartOf":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/azure-hdinsight-integration-with-data-lake-storage-gen-2-preview-acl-and-security-update\/"},"author":[{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/author\/microsoft-azure\/","@type":"Person","@name":"Microsoft Azure"}],"headline":"Azure HDInsight integration with Data Lake Storage Gen2 preview &#8211; ACL and security update","datePublished":"2018-12-10T00:00:00+00:00","dateModified":"2023-05-11T22:35:29+00:00","mainEntityOfPage":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/azure-hdinsight-integration-with-data-lake-storage-gen-2-preview-acl-and-security-update\/"},"wordCount":879,"commentCount":0,"publisher":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization"},"image":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/azure-hdinsight-integration-with-data-lake-storage-gen-2-preview-acl-and-security-update\/#primaryimage"},"thumbnailUrl":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2018\/12\/80d198fd-aedf-4640-8cde-3e62796e8473.webp","articleSection":["Analytics","Storage"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/azure.microsoft.com\/en-us\/blog\/azure-hdinsight-integration-with-data-lake-storage-gen-2-preview-acl-and-security-update\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/azure-hdinsight-integration-with-data-lake-storage-gen-2-preview-acl-and-security-update\/","url":"https:\/\/azure.microsoft.com\/en-us\/blog\/azure-hdinsight-integration-with-data-lake-storage-gen-2-preview-acl-and-security-update\/","name":"Azure HDInsight integration with Data Lake Storage Gen2 preview - ACL and security update | Microsoft Azure Blog","isPartOf":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/azure-hdinsight-integration-with-data-lake-storage-gen-2-preview-acl-and-security-update\/#primaryimage"},"image":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/azure-hdinsight-integration-with-data-lake-storage-gen-2-preview-acl-and-security-update\/#primaryimage"},"thumbnailUrl":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2018\/12\/80d198fd-aedf-4640-8cde-3e62796e8473.webp","datePublished":"2018-12-10T00:00:00+00:00","dateModified":"2023-05-11T22:35:29+00:00","description":"Today we are sharing an update to the Azure HDInsight integration with Azure Data Lake Storage Gen 2. This integration will enable HDInsight customers to drive analytics from the data stored in Azure Data Lake Storage Gen 2 using popular open source frameworks such as Apache Spark, Hive, MapReduce, Kafka, Storm, and HBase in a secure manner.","breadcrumb":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/azure-hdinsight-integration-with-data-lake-storage-gen-2-preview-acl-and-security-update\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/azure.microsoft.com\/en-us\/blog\/azure-hdinsight-integration-with-data-lake-storage-gen-2-preview-acl-and-security-update\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/azure-hdinsight-integration-with-data-lake-storage-gen-2-preview-acl-and-security-update\/#primaryimage","url":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2018\/12\/80d198fd-aedf-4640-8cde-3e62796e8473.webp","contentUrl":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2018\/12\/80d198fd-aedf-4640-8cde-3e62796e8473.webp"},{"@type":"BreadcrumbList","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/azure-hdinsight-integration-with-data-lake-storage-gen-2-preview-acl-and-security-update\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Blog home","item":"https:\/\/azure.microsoft.com\/en-us\/blog\/"},{"@type":"ListItem","position":2,"name":"Analytics","item":"https:\/\/azure.microsoft.com\/en-us\/blog\/category\/analytics\/"},{"@type":"ListItem","position":3,"name":"Azure HDInsight integration with Data Lake Storage Gen2 preview &#8211; ACL and security update"}]},{"@type":"WebSite","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#website","url":"https:\/\/azure.microsoft.com\/en-us\/blog\/","name":"Microsoft Azure Blog","description":"Get the latest Azure news, updates, and announcements from the Azure blog. From product updates to hot topics, hear from the Azure experts.","publisher":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/azure.microsoft.com\/en-us\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization","name":"Microsoft Azure Blog","url":"https:\/\/azure.microsoft.com\/en-us\/blog\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/06\/microsoft_logo.webp","contentUrl":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/06\/microsoft_logo.webp","width":512,"height":512,"caption":"Microsoft Azure Blog"},"image":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/microsoftazure","https:\/\/x.com\/azure","https:\/\/www.instagram.com\/microsoftdeveloper\/","https:\/\/www.linkedin.com\/company\/16188386","https:\/\/www.youtube.com\/user\/windowsazure"]},{"@type":"Person","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/person\/c702e5edd662b328b49b7e1180cab117","name":"shakir","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/secure.gravatar.com\/avatar\/9342c7c05bb16548741bc5cd3a3e3b7ee0c8e746844ad2cc582db5beb5514c6f?s=96&d=mm&r=g7664e653ea371ce16eaf75e9fa8952c4","url":"https:\/\/secure.gravatar.com\/avatar\/9342c7c05bb16548741bc5cd3a3e3b7ee0c8e746844ad2cc582db5beb5514c6f?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/9342c7c05bb16548741bc5cd3a3e3b7ee0c8e746844ad2cc582db5beb5514c6f?s=96&d=mm&r=g","caption":"shakir"},"sameAs":["https:\/\/azure.microsoft.com"],"url":"https:\/\/azure.microsoft.com\/en-us\/blog\/author\/shakir\/"}]}},"msxcm_display_generated_audio":false,"msxcm_animated_featured_image":null,"distributor_meta":false,"distributor_terms":false,"distributor_media":false,"distributor_original_site_name":"Microsoft Azure Blog","distributor_original_site_url":"https:\/\/azure.microsoft.com\/en-us\/blog","push-errors":false,"_links":{"self":[{"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/posts\/1883","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/comments?post=1883"}],"version-history":[{"count":0,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/posts\/1883\/revisions"}],"wp:attachment":[{"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/media?parent=1883"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/categories?post=1883"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/tags?post=1883"},{"taxonomy":"audience","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/audience?post=1883"},{"taxonomy":"content-type","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/content-type?post=1883"},{"taxonomy":"product","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/product?post=1883"},{"taxonomy":"tech-community","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/tech-community?post=1883"},{"taxonomy":"topic","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/topic?post=1883"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/coauthors?post=1883"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}