Skip Navigation

Azure HDInsight pricing

Azure HDInsight is a fully-managed cloud service that makes it easy, fast, and cost-effective to process massive amounts of data. Use the most popular open-source frameworks such as Hadoop, Spark, Hive, LLAP, Kafka, Storm, HBase, Microsoft ML Server & more. Azure HDInsight enables a broad range of scenarios such as ETL, Data Warehousing, Machine Learning, IoT and more.

Service features

Preconfigured clusters optimized for different big data scenarios

99.9 % SLA on the cluster

High Availability

Cost-effective for cloud scale

On Demand job execution via Azure Data Factory

Network Security: Secure Gateway Azure VNET Support

Data Security: Encryption +Role-based access control on Storage

Advanced Monitoring with Azure Log Analytics

Integration: Azure Cosmos DB and other Azure data services

Active Directory and Apache Ranger integration with Enterprise Security Package

Components

Hadoop

Spark

Interactive Query

HDInsight Machine Learning Services*

Kafka

HBase

Storm

Extend HDInsight to install any Open Source Engine**

*HDInsight Machine Learning Services incurs additional surcharge

**No support of SLA provided by Microsoft for these open source apps. Support and SLA only provided for the above workloads.

Pricing features

Azure HDInsight Clusters

Billed on a per minute basis, clusters run a group of nodes depending on the component. Nodes vary by group (e.g. Worker Node, Head Node, etc.), quantity, and instance type (e.g. D1v2).

Refer to the FAQ below for details on workloads and the required nodes. Customers will be billed for each node for the duration of the cluster's life. The node price below represents for all workloads except Microsoft ML Server, which incurs an additional surcharge.

Enterprise Security Package

Enterprise Security Package is now available in GA . There is an add-on surcharge for Enterprise Security Package and will be reduced by 50% to $-/core-hour starting October 1, 2018.

Pricing details


Component Pricing
Hadoop, Spark, Interactive Query, Kafka*, Storm, HBase Base price/node-hour
HDInsight Machine Learning Services** Base price/node-hour + $-/core-hour
Enterprise Security Package Base price/node-hour + $-/core-hour
*Kafka requires managed disks. Choose between Standard Managed Disks or Premium Managed Disks. Managed disk prices apply.
**HDInsight Machine Learning Services incurs additional surcharge

Base price/node-hour

Memory Optimized nodes for HDInsight

Instance CPU RAM OS HDInsight Price Total Price++
D11 v2 2 14 GB $- $- $-
D12 v2 4 28 GB $- $- $-
D13 v2 8 56 GB $- $- $-
D14 v2 16 112 GB $- $- $-
++Customers will continue to see one line item on their bill for Total Price
Instance CPU RAM OS HDInsight Price Total Price++
G1 2 28 GB $- $- $-
G2 4 56 GB $- $- $-
G3 8 112 GB $- $- $-
G4 16 224 GB $- $- $-
G5 32 448 GB $- $- $-
++Customers will continue to see one line item on their bill for Total Price

General Purpose nodes for HDInsight

AV2 HDInsight nodes run on Av2 Standard VM, which is the latest generation of A-series virtual machines with similar CPU performance and faster disk.

Instance CPU RAM OS HDInsight Price Total Price++
A1 v2 1 2 GB $- $- $-
A2m v2 2 16 GB $- $- $-
A2 v2 2 4 GB $- $- $-
A4m v2 4 32 GB $- $- $-
A4 v2 4 8 GB $- $- $-
A8m v2 8 64 GB $- $- $-
A8 v2 8 16 GB $- $- $-
++Customers will continue to see one line item on their bill for Total Price
Instance CPU RAM OS HDInsight Price Total Price++
A1 1 1.75 GB $- $- $-
A2 2 3.5 GB $- $- $-
A3 4 7 GB $- $- $-
A4 8 14 GB $- $- $-
A5 2 14 GB $- $- $-
A6 4 28 GB $- $- $-
A7 8 56 GB $- $- $-
++Customers will continue to see one line item on their bill for Total Price

Instance CPU RAM OS HDInsight Price Total Price++
D1 v2 1 3.5 GB $- $- $-
D2 v2 2 7 GB $- $- $-
D3 v2 4 14 GB $- $- $-
D4 v2 8 28 GB $- $- $-
D5 v2 16 56 GB $- $- $-
++Customers will continue to see one line item on their bill for Total Price

Dev/Test pricing available

Discounted Azure pricing is available for Visual Studio subscribers looking to run development and testing workloads, individually or as a team. Active Visual Studio subscribers can take advantage of a wide range of discounts when using an Azure subscription based on a dev/test offer. Learn more about dev/test offers and Visual Studio subscriptions.

Support & SLA

  • Free billing and subscription management support.
  • Flexible support plans starting at $29/month. Find a plan.
  • Guaranteed 99.9% connectivity for multiple instances. Read the SLA.

FAQ

  • HDInsight deploys different number of nodes for each cluster type. Within a given cluster type, there are different roles for the various nodes, which allow a customer to size those nodes in a given role appropriate to the details of their workload. For example, a Hadoop cluster can have its worker nodes provisioned with a large amount of memory if the type of analytics being performed are memory intensive.

    Hadoop clusters for HDInsight are deployed with two roles:

    • Head node (2 nodes)
    • Data node (at least 1 node)

    HBase clusters for HDInsight are deployed with three roles:

    • Head servers (2 nodes)
    • Region servers (at least 1 node)
    • Master/Zookeeper nodes (3 nodes)

    Storm clusters for HDInsight are deployed with three roles:

    • Nimbus nodes (2 nodes)
    • Supervisor servers (at least 1 node)
    • Zookeeper nodes (3 nodes)

    Spark clusters for HDInsight are deployed with three roles:

    • Head node (2 nodes)
    • Worker node (at least 1 node)
    • Zookeeper nodes (3 nodes) (Free for A1 zookeepers)

    The use of R-Server will incur one edge node in addition to the cluster deployment architecture.

  • We charge for the number of minutes your cluster is running, rounded to the nearest minute, not hour.

  • If you run a cluster for 100 hours in US East with two D13 v2 head nodes, three D12 v2 data nodes, and three D11 v2 zookeepers, the billing would be the following in the two scenarios:

    • On a Standard HDInsight cluster—100 hours x (2 x $-/hour + 3 x $-/hour + 3 x $-/hour) = $-
    • On a Standard HDInsight cluster with Enterprise Security Package—100 hours x (2 x $-/hour + 3 x $-/hour + 3 x $-/hour) + 100 hours x (2 x 8 + 3 x 4 + 3 x 2) x $-/core-hour = $-
  • In order to stop an HDInsight cluster, you must delete the cluster. By default, all data an HDInsight cluster operates on resides in Azure Blob storage, so data will not be impacted by this. If you want to preserve your Hive metadata (tables, schemas) you should provision a cluster with an external metadata store. You can find more details in this documentation.

  • The number of data nodes will vary depending on your needs. With the elasticity available in Azure cloud services, you can try a variety of cluster sizes to determine your own optimal mix of performance and cost, and only pay for what you use at any given time. Clusters can also be scaled on demand to grow and shrink to match the requirements of your workload.

  • Each subscription has a default limit on how many HDInsight data nodes can be created. If you need to create a larger HDInsight cluster or multiple HDInsight clusters that together exceed your current subscription maximum, you can request that your subscription's billing limits be increased. Please open a support ticket with Support Type = Billing. Depending on the maximum nodes per subscription that you request, you may be asked for additional information that will allow us to optimize your deployment(s).

  • To estimate the cost of clusters of various sizes, try the Azure Calculator.

Resources

Estimate your monthly costs for Azure services

Review Azure pricing frequently asked questions

Learn more about HDInsight

Review technical tutorials, videos, and more resources

Added to estimate. Press 'v' to view on calculator View on calculator

Learn and build with $200 in credit, and keep going for free