Easy, cost-effective, enterprise-grade service for open source analytics

What are the benefits of HDInsight?

Easily run popular open source frameworks – including Apache Hadoop, Spark and Kafka – using Azure HDInsight, a cost-effective, enterprise-grade service for open source analytics. Effortlessly process massive amounts of data and get all the benefits of the broad open source ecosystem with the global scale of Azure.

  • Use HDInsight tools to easily get started in your favourite development environment.
  • HDInsight integrates seamlessly with other Azure services, including Data Factory and Data Lake Storage, for building comprehensive analytics pipelines.

Why HDInsight?


Quickly spin up open source projects and clusters, with no hardware to install or infrastructure to manage.


Reduce costs by creating big data clusters on demand, easily scaling them up or down, and paying only for what you use.


Get enterprise-grade security and industry-leading compliance with more than 30 certifications.


Create optimised components for Hadoop, Spark and more. Keep up to date with the latest versions.

What comes with HDInsight?

Open source ecosystem

HDInsight supports the latest open source projects from the Apache Hadoop and Spark ecosystems. Stay up to date with the newest releases of open source frameworks, including Kafka, HBase and Hive LLAP.

Security and compliance

Get enterprise-grade data protection with monitoring, virtual networks, encryption, Active Directory authentication, authorisation and role-based access control. HDInsight has more than 30 industry certifications, including ISO, SOC, HIPAA and PCI, to meet compliance standards.

Native integration with Azure services

Seamlessly integrate with a wide variety of Azure data stores and services, including Synapse Analytics, Azure Cosmos DB, Data Lake Storage, Blob Storage, Event Hubs and Data Factory.

Simplified monitoring

HDInsight integrates with Azure Log Analytics to provide a single interface where you can monitor all your clusters.

Broad app support

HDInsight supports a broad range of apps from the big data ecosystem, which you can install with a single click. Pick from more than 30 popular Hadoop and Spark apps for a variety of scenarios.

Multiple languages and tools

Use your preferred productivity tools, including Visual Studio, Eclipse, IntelliJ, Jupyter and Zeppelin. Write code in familiar languages such as Scala, Python, R, JavaScript and .NET.

Azure HDInsight ecosystem


Apache Zeppelin

VS Code



Data access





Machine Learning


Azure Data Lake Storage


Apache Ranger

Azure Active Directory

Virtual Network

Customers using HDInsight

Reckitt Benckiser

What can you do with HDInsight?

Extract, transform and load (ETL) using HDInsight

Extract, transform and load your big data clusters on demand with Hadoop MapReduce and Apache Spark.

HDInsight를 사용한 ETL(추출, 변환 및 로드)요청이 있을 시 Hadoop MapReduce 및 Apache Spark를 사용하여 빅 데이터 클러스터를 추출, 변환, 로드하세요.

Streaming using HDInsight

Ingest and process millions of streaming events per second with Apache Kafka, Apache Storm and Apache Spark Streaming.

HDInsight를 사용한 스트리밍Apache Kafka, Apache Storm 및 Apache Spark Streaming을 사용하여 초당 수백만 개의 스트리밍 이벤트를 수집하고 처리하세요.

Interactive querying with HDInsight

Perform fast, interactive SQL queries at scale over structured or unstructured data with Apache Hive LLAP.

HDInsight의 대화형 쿼리Apache Hive LLAP를 사용하여 구조적 또는 비구조적 데이터에 대한 신속한 대화형 SQL 쿼리를 대규모로 수행하세요.

Extend your on-premises big data investments with HDInsight

Extend your on-premises big data investments to the cloud and transform your business using the advanced analytics capabilities of HDInsight.

HDInsight로 온-프레미스 빅 데이터 투자 확장온-프레미스 빅 데이터 투자를 클라우드로 확장하고, HDInsight의 고급 분석 기능을 사용하여 비즈니스를 혁신하세요.

Use Cases

Customer insights

Help employees make data-driven decisions by building an end-to-end open source analytics platform. Easily process massive amounts of data from different sources.

Learn how Reckitt Benckiser uses HDInsight for consumer insights.

Personalised recommendations

Engage your customers in new ways by building personalised recommendation engines.

Learn how ASOS uses HDInsight for personalised recommendations.

Predictive maintenance

Predict and prevent failures and keep vital equipment running. Ingest and process data in real time to optimise operations.

Learn how Roche Diagnostics uses HDInsight for predictive maintenance.

Risk assessment

Build better models by transforming and analysing your critical data, and help keep your data secure with enterprise-grade capabilities.

Learn how Milliman uses HDInsight for risk assessment.

Related products and services

Azure Databricks

Fast, easy and collaborative Apache Spark-based analytics platform

Azure Data Lake Storage

Massively scalable, secure data lake functionality built on Azure Blob Storage

Azure Synapse Analytics

Limitless analytics service with unmatched time to insight (formerly SQL Data Warehouse)

Start using HDInsight today