HDInsight pricing

Provision cloud Hadoop, Spark, R Server, HBase, and Storm clusters

Azure HDInsight is a fully-managed cloud service that makes it easy, fast, and cost-effective to process massive amounts of data. Use the most popular open-source frameworks such as Hadoop, Spark, Hive, LLAP, Kafka, Storm, HBase, Microsoft ML Server & more. Azure HDInsight enables a broad range of scenarios such as ETL, Data Warehousing, Machine Learning, IoT and more.

Explore pricing options

Apply filters to customize pricing options to your needs.

Azure HDInsight Clusters

Billed on a per minute basis, clusters run a group of nodes depending on the component. Nodes vary by group (e.g. Worker Node, Head Node, etc.), quantity, and instance type (e.g. D1v2).

Refer to the FAQ below for details on workloads and the required nodes. Customers will be billed for each node for the duration of the cluster's life. The node price below represents for all workloads except Microsoft ML Server, which incurs an additional surcharge.


Component Pricing
Hadoop, Spark, Interactive Query, Kafka*, Storm, HBase Base price/node-hour + $0/core-hour
HDInsight Machine Learning Services** Base price/node-hour + $0.016/core-hour
Enterprise Security Package Base price/node-hour + $-/core-hour
*Kafka requires managed disks. Choose between Standard Managed Disks or Premium Managed Disks. Managed disk prices apply.
**HDInsight Machine Learning Services incurs additional surcharge

Base price/node-hour

Listed below is the full offering of all Azure VMs. Additionally, although it only shows Ev3 pricing, our Esv3 instances are offered at the same price. To see which VMs are supported by HDInsight, and their prices, please refer to the “Configuration & Pricing” section at the cluster creation stage on Azure Portal. Reference Microsoft Docs for additional information on HDInsight default and recommended VMs.

Memory Optimized nodes for HDInsight

Instance vCPU(s) RAM OS HDInsight Price Total Price*
E2 v3 2 16 GB $- $- $-
E4 v3 4 32 GB $- $- $-
E8 v3 8 64 GB $- $- $-
E16 v3 16 128 GB $- $- $-
E20 v3 20 160 GB $- $- $-
E32 v3 32 256 GB $- $- $-
E64i v3 64 432 GB $- $- $-
E64 v3 64 432 GB $- $- $-
*Customers will continue to see one line item on their bill for Total Price
Instance CPU RAM OS HDInsight Price Total Price*
D12 v2 4 28 GB $- $- $-
D13 v2 8 56 GB $- $- $-
D14 v2 16 112 GB $- $- $-
*Customers will continue to see one line item on their bill for Total Price
Instance CPU RAM OS HDInsight Price Total Price*
D2a v4 2 8 GB $- $- $-
D4a v4 4 16 GB $- $- $-
D8a v4 8 32 GB $- $- $-
D16a v4 16 64 GB $- $- $-
D32a v4 32 128 GB $- $- $-
D48a v4 48 192 GB $- $- $-
D64a v4 64 256 GB $- $- $-
D96a v4 96 384 GB $- $- $-
*Customers will continue to see one line item on their bill for Total Price

Compute Optimized nodes for HDInsight

Instance CPU RAM OS HDInsight Price Total Price*
F4 4 8 GB $- $- $-
F8 8 16 GB $- $- $-
F16 16 32 GB $- $- $-
*Customers will continue to see one line item on their bill for Total Price

General Purpose nodes for HDInsight

AV2 HDInsight nodes run on Av2 Standard VM, which is the latest generation of A-series virtual machines with similar CPU performance and faster disk.

Instance CPU RAM OS HDInsight Price Total Price*
A1 v2 1 2 GB $- $- $-
A2 v2 2 4 GB $- $- $-
A2m v2 2 16 GB $- $- $-
A4 v2 4 8 GB $- $- $-
A4m v2 4 32 GB $- $- $-
A8 v2 8 16 GB $- $- $-
A8m v2 8 64 GB $- $- $-
*Customers will continue to see one line item on their bill for Total Price
Instance CPU RAM OS HDInsight Price Total Price*
A1 1 1.75 GB $- $- $-
A2 2 3.5 GB $- $- $-
A5 2 14 GB $- $- $-
A3 4 7 GB $- $- $-
A6 4 28 GB $- $- $-
A4 8 14 GB $- $- $-
A7 8 56 GB $- $- $-
*Customers will continue to see one line item on their bill for Total Price
Instance CPU RAM OS HDInsight Price Total Price*
D1 v2 1 3.5 GB $- $- $-
D2 v2 2 7 GB $- $- $-
D3 v2 4 14 GB $- $- $-
D4 v2 8 28 GB $- $- $-
D5 v2 16 56 GB $- $- $-
*Customers will continue to see one line item on their bill for Total Price

Azure pricing and purchasing options

Connect with us directly

Get a walkthrough of Azure pricing. Understand pricing for your cloud solution, learn about cost optimization, and request a custom proposal.

Talk to a sales specialist

See ways to purchase

Purchase Azure services through the Azure website, a Microsoft representative, or an Azure partner.

Explore your options

Additional resources

HDInsight details

Learn more about HDInsight features and capabilities.

Pricing calculator

Estimate your expected monthly costs for using any combination of Azure products.

SLA

Review the Service Level Agreement for this product.

Documentation

Review technical tutorials, videos, and more HDInsight resources.

Frequently asked questions

  • HDInsight deploys different number of nodes for each cluster type. Within a given cluster type, there are different roles for the various nodes, which allow a customer to size those nodes in a given role appropriate to the details of their workload. For example, a Hadoop cluster can have its worker nodes provisioned with a large amount of memory if the type of analytics being performed are memory intensive.

    Hadoop clusters for HDInsight are deployed with two roles:

    • Head node (2 nodes)
    • Data node (at least 1 node)

    HBase clusters for HDInsight are deployed with three roles:

    • Head servers (2 nodes)
    • Region servers (at least 1 node)
    • Master/Zookeeper nodes (3 nodes)

    Storm clusters for HDInsight are deployed with three roles:

    • Nimbus nodes (2 nodes)
    • Supervisor servers (at least 1 node)
    • Zookeeper nodes (3 nodes)

    Spark clusters for HDInsight are deployed with three roles:

    • Head node (2 nodes)
    • Worker node (at least 1 node)
    • Zookeeper nodes (3 nodes) (Free for A1 zookeepers)

    The use of R-Server will incur one edge node in addition to the cluster deployment architecture.

  • We charge for the number of minutes your cluster is running, rounded to the nearest minute, not hour.

  • If you run a cluster for 100 hours in US East with two D13 v2 head nodes and three D12 v2 data nodes, the billing would be the following in the two scenarios:

    • On a Standard HDInsight cluster—100 hours x (2 x $-/hour + 3 x $-/hour) = $-
    • On a Standard HDInsight cluster with Enterprise Security Package—100 hours x (2 x $-/hour + 3 x $-/hour) + 100 hours x (2 x 8 + 3 x 4 + 3 x 2) x $-/core-hour = $-
  • In order to stop an HDInsight cluster, you must delete the cluster. By default, permanent data generated from HDInsight cluster should be written to Azure Blob Storage, Azure Data Lake Storage Gen1, or Azure Data Lake Storage Gen2 so the data is not impacted by the deletion. If you want to preserve your Hive metadata (tables and schemas) you should provision a cluster with an external metadata store. You can find more details in this documentation.

  • There are a number of options to reduce the costs:

  • The number of data nodes will vary depending on your needs. With the elasticity available in Azure cloud services, you can try a variety of cluster sizes to determine your own optimal mix of performance and cost, and only pay for what you use at any given time. Clusters can also be scaled on demand to grow and shrink to match the requirements of your workload.

  • Each subscription has a default limit on how many HDInsight data nodes can be created. If you need to create a larger HDInsight cluster or multiple HDInsight clusters that together exceed your current subscription maximum, you can request that your subscription's billing limits be increased. Please open a support ticket with Support Type = Billing. Depending on the maximum nodes per subscription that you request, you may be asked for additional information that will allow us to optimize your deployment(s).

  • To estimate the cost of clusters of various sizes, try the Azure Calculator.

Talk to a sales specialist for a walk-through of Azure pricing. Understand pricing for your cloud solution.

Get free cloud services and a $200 credit to explore Azure for 30 days.