Provisions a Spark Cluster on Ubuntu Virtual Machines

Last updated: 4/6/2015

Cost estimate

$178.5600000000

Estimated monthly cost

Understand how this was calculated and customize to your needs

Pricing calculator

This template creates a Spark cluster on Ubuntu virtual machine image, enables persistence (by default) and applies all well-known optimizations and best practices

This Azure Resource Manager template was created by a member of the community and not by Microsoft. Each Resource Manager template is licensed to you under a license agreement by its owner, not Microsoft. Microsoft is not responsible for Resource Manager templates provided and licensed by community members and does not screen for security, compatibility, or performance. Community Resource Manager templates are not supported under any Microsoft support program or service, and are made available AS IS without warranty of any kind.

Parameters

Parameter Name Description
adminUsername Administrator user name used when provisioning virtual machines
adminPassword Administrator password used when provisioning virtual machines
imagePublisher Image Publisher
imageOffer Image Offer
imageSKU Image SKU
virtualNetworkName The arbitrary name of the virtual network provisioned for the cluster
addressPrefix The network address space for the virtual network
subnetName Subnet name for the virtual network that resources will be provisioned in to
subnetPrefix Address space for the virtual network subnet
sparkVersion The version of the Spark package to be deployed on the cluster (or use 'stable' to pull in the latest and greatest)
sparkClusterName The arbitrary name of the Spark cluster (maps to cluster's configuration file name)
sparkNodeIPAddressPrefix The IP address prefix that will be used for constructing a static private IP address for each node in the cluster
sparkSlaveNodeIPAddressPrefix The IP address prefix that will be used for constructing a static private IP address for each node in the cluster
jumpbox The flag allowing to enable or disable provisioning of the jumpbox VM that can be used to access the Spark nodes
tshirtSize T-shirt size of the Spark cluster

Use the template

PowerShell

New-AzureRmResourceGroupDeployment -Name <deployment-name> -ResourceGroupName <resource-group-name> -TemplateUri https://raw.githubusercontent.com/azure/azure-quickstart-templates/master/spark-on-ubuntu/azuredeploy.json
Install and configure Azure PowerShell

Command line

azure config mode arm
azure group deployment create <my-resource-group> <my-deployment-name> --template-uri https://raw.githubusercontent.com/azure/azure-quickstart-templates/master/spark-on-ubuntu/azuredeploy.json
Install and Configure the Azure Cross-Platform Command-Line Interface