Skip Navigation

Data Lake Storage Gen1

Explore the applications our enterprise customers and partners are using to build big data solutions with Azure Data Lake.

Azure Data Lake Storage Gen1 (ADLS Gen1) supports multiple analytics workload engines. Integrate commercial distributions from Cloudera, Hortonworks, and Qubole that have support for ADLS, or build your own clusters with the Apache open-source client.

Cloudera

Cloudera

Cloudera Enterprise combines Hadoop with other open-source projects to create a single massively scalable system—the Enterprise Data Hub—that combines storage with an array of powerful processing and analytic frameworks. Run Spark, Hive, HBase, Impala, and MapReduce workloads in a Cloudera cluster on Azure Data Lake Store.

Hortonworks

Hortonworks

Hortonworks Data Platform can easily integrate with Azure Data Lake Store to enable enterprises to work with large volumes of structured and unstructured data. Try Cloudbreak for Hortonworks Data Platform (HDP) to quickly provision an HDP cluster and start working with ADLS data.

Qubole

Qubole

Qubole Data Service (QDS) is a comprehensive big data platform that self-manages, self-optimizes, and learns from your usage. Qubole provides a unified platform for ETL, streaming, data science, and ad hoc analytics. In addition to Hadoop and Spark, Qubole now offers interactive querying capability on ADLS with Presto.

Apache Hadoop

Apache Hadoop

Azure Data Lake has an open-source client for Apache Hadoop. Use this to develop custom Hadoop and Spark clusters.

Azure Data Lake customers also gain insights on their business data using a wide range of big data ecosystem applications.

AtScale

AtScale

Query data in place with OLAP and your business intelligence tool of choice, without additional data movement, whether it lands in your Big Data Lake or HDInsight cluster.

Imanis Data

Imanis Data

Imanis Data (formerly Talena, Inc) delivers an enterprise-grade backup and recovery software solution for modern data environments. Designed with rapid recovery in mind in the event of developer errors or application corruption in your production data environment, Imanis Data supports both Azure Blob Storage and ADLS.

Paxata

Paxata

Paxata's Adaptive Information Platform enables any user to rapidly gain insights from their data on ADLS and other sources. Profile structured or unstructured data for completeness and quality, enrich it with additional context and prepare it for further consumption. Business users can do all the above in an intuitive visual experience in a completely governed environment with added security and compliance provided by Azure Data Lake and Azure HDInsight.

StreamSets

StreamSets

Efficiently develop batch and streaming dataflows, operate them with full visibility and control, and easily evolve your architecture over time with StreamSets Data Collector.