HDInsight pricing

Azure HDInsight offers fully managed and supported 100 percent Apache Hadoop®, Spark, HBase and Storm clusters. You can get up and running quickly on any of these workloads with a few clicks and within a few minutes, without buying hardware or hiring specialised operations teams typically associated with big data infrastructure.

HDInsight clusters consist of a set of nodes. Customers will be billed for the usage of their nodes for the duration of the cluster’s life. Billing starts once a cluster is created and stops when the cluster is deleted and is prorated per minute. Choose between standard or premium cluster. Premium clusters are available for an additional surcharge on top of the standard price. Premium clusters are in public preview and the price below reflects 50 percent of general availability price.

R Server clusters are available for an additional surcharge. HDInsight Kafka is under limited Public Preview. There is no additional surcharge for HDInsight Kafka.

Azure Cloud Data Platform

HDInsight Feature Standard
Base Price/Node-hour
Premium
Base Price/Node-hour + $-/Core-hour Preview
Unlimited Scale
Elastic scale
Easy provisioning
High availability
SLA

Enterprise Readiness

HDInsight Feature Standard
Base Price/Node-hour
Premium
Base Price/Node-hour + $-/Core-hour Preview
Basic Monitoring
Hadoop versions upgrades and patching
Encryption of data at rest
Secure Gateway and Zookeeper Nodes
Ranger (secure Hadoop) + AD integration

Type of Clusters

HDInsight Feature Standard
Base Price/Node-hour
Premium
Base Price/Node-hour + $-/Core-hour Preview
Hadoop (Hive, Pig, Storm)
Hadoop with LLAP (interactive querying)
HBase for NoSQL Database
Storm for real-time stream analytics
Spark for in-memory SQL querying, ML and real-time stream analytics
R Server for parallelised machine learning

How pricing works

Hadoop, HBase, Storm, Spark R-Server Kafka
Standard Cluster Base Price/Node-hour Base Price/Node-hour + $-/Core-hour Base Price/Node-hour + $-/Core-hour + Storage1
Premium Cluster Preview Base Price/Node-hour + $-/Core-hour
1 Kafka requires managed disks. Choose between Standard Managed Disks or Premium Managed Disks. Managed disk prices apply.

Base Price/Node-hour

With HDInsight, full Hadoop support is included in your Azure support agreement. An Azure support subscription provides expert technical assistance from Microsoft for organisations implementing Hadoop, as well as for all other services in Azure. It is also backed by Hortonworks, the leading contributor to Apache Hadoop open source. Hortonworks employs some of the original developers and architects of Hadoop who have contributed more than half of all code to Hadoop. This makes Microsoft more qualified to support your deployment through expert technical assistance and enables us to fix and commit code back to open source Hadoop.

Other Azure services associated with HDInsight, such as Storage and Data Transfers, are billed separately using standard rates. To estimate your bill, try the Azure Calculator.

Please note that HDInsight writes log files in the storage account associated with the cluster. These log files are available even after the cluster is dropped and can be deleted at any time.

Base Price for A-Series General purpose nodes for HDInsight

The use of A1, A2, A3 nodes is limited to Zookeeper nodes and Headnodes. A-series is not available for data or worker nodes.

Instance Cores RAM Disk Sizes Base Price / Node
A1 1 1.75 GB 70 GB $-
A2 2 3.5 GB 135 GB $-
A3 4 7 GB 285 GB $-
A4 8 14 GB 605 GB $-
A5 2 14 GB 135 GB $-
A6 4 28 GB 285 GB $-
A7 8 56 GB 605 GB $-

Base Price for A-Series Compute Intensive

Instance Cores RAM Disk Sizes Base Price / Node
A10 8 56 GB 382 GB $-
A11 16 112 GB 382 GB $-

Base Price for Dv2-Series Optimised nodes: 35% faster than D-series, latest generation of CPU

Instance Cores RAM Disk Sizes Base Price / Node
D1 v2 1 3.5 GB 50 GB $-
D2 v2 2 7 GB 100 GB $-
D3 v2 4 14 GB 200 GB $-
D4 v2 8 28 GB 400 GB $-
D5 v2 16 56 GB 800 GB $-
D11 v2 2 14 GB 100 GB $-
D12 v2 4 28 GB 200 GB $-
D13 v2 8 56 GB 400 GB $-
D14 v2 16 112 GB 800 GB $-

Support & SLA

  • Free billing and subscription management support.
  • Flexible support plans starting at $29/month. Find a plan.
  • Guaranteed 99.9% connectivity for multiple instances. Read the SLA.

FAQ

  • HDInsight deploys different number of nodes for each cluster type. Within a given cluster type, there are different roles for the various nodes, which allow a customer to size those nodes in a given role appropriate to the details of their workload. For example, a Hadoop cluster can have its worker nodes provisioned with a large amount of memory if the type of analytics being performed are memory intensive.

    Hadoop clusters for HDInsight are deployed with two roles:

    • Head node (2 nodes)
    • Data node (at least 1 node)

    HBase clusters for HDInsight are deployed with three roles:

    • Head servers (2 nodes)
    • Region servers (at least 1 node)
    • Master/Zookeeper nodes (3 nodes)

    Storm clusters for HDInsight are deployed with three roles:

    • Nimbus nodes (2 nodes)
    • Supervisor servers (at least 1 node)
    • Zookeeper nodes (3 nodes)

    Spark clusters for HDInsight are deployed with three roles:

    • Head node (2 nodes)
    • Worker node (at least 1 node)
    • Zookeeper nodes (3 nodes) (Free for A1 Zookeepers)

    The use of R-Server will incur one edge node in addition to the cluster deployment architecture.

  • We charge for the number of minutes your cluster is running, rounded to the nearest minute, not hour.

  • If you run a cluster for 100 hours in US East with 2 head nodes D13 v2, 3 data nodes D12 v2 and 3 zookeepers D11, the billing would be the following in the two scenarios:

    • Use HDInsight Standard on a core HDInsight cluster—100 hours x (2 x $1.368/hour + 3 x $0.76/hour + 3 x $0.38/hour) = $615.6
    • Use HDInsight Premium on a core HDInsight cluster with R-Server—100 hours x (2 x $1.368/hour + 3 x $0.76/hour + 3 x $0.38/hour) + 100 hours x (2 x 8 + 3 x 4 + 3 x 2) x $-/Core-hour = $751.6
  • In order to stop an HDInsight cluster, you must delete the cluster. By default, all data an HDInsight cluster operates on resides in Azure Blob Storage, so data will not be impacted by this. If you want to preserve your Hive metadata (tables, schemas) you should provision a cluster with an external metadata store. You can find more details in this documentation.

  • The number of data nodes will vary depending on your needs. With the elasticity available in Azure cloud services, you can try a variety of cluster sizes to determine your own optimal mix of performance and cost and only pay for what you use at any given time. Clusters can also be scaled on demand to grow and shrink to match the requirements of your workload.

  • Each subscription has a default limit on how many HDInsight data nodes can be created. If you need to create a larger HDInsight cluster or multiple HDInsight clusters that together exceed your current subscription maximum, you can request that your subscription's billing limits be increased. Please open a support ticket with Support Type = Billing. Depending on the maximum nodes per subscription that you request, you may be asked for additional information that will allow us to optimise your deployment(s).

  • To estimate the cost of clusters of various sizes, try the Azure Calculator.

Resources

Estimate your monthly costs for Azure services

Review Azure pricing frequently asked questions

Learn more about HDInsight

Review technical tutorials, videos, and more resources

Learn and build with $200 in credit and keep going for free

Free account