Spark on Ubuntu VMs

Last updated: 4/26/2015

This template creates Spark streaming replication from one master to one or more slaves each configured with multiple striped data disks. The database servers are deployed into a private subnet with an optional externally accessible jumpbox.

This Azure Resource Manager template was created by a member of the community and not by Microsoft. Each Resource Manager template is licensed to you under a license agreement by its owner, not Microsoft. Microsoft is not responsible for Resource Manager templates provided and licensed by community members and does not screen for security, compatibility, or performance. Community Resource Manager templates are not supported under any Microsoft support program or service, and are made available AS IS without warranty of any kind.

Parameters

Parameter Name Description
storageAccountNamePrefix Unique name of the new storage account that will be created to store virtual machine VHDs
domainName Domain name of the public jumpbox
adminUsername Virtual machine administrator username
adminPassword Virtual machine administrator password
tshirtSize T-shirt size of the Spark deployment
sparkversion Version of Spark
jumpbox The flag allowing to enable or disable provisioning of the jumpbox VM that can be used to access the Spark environment
virtualNetworkName Virtual network name

Use the template

PowerShell

New-AzureRmResourceGroupDeployment -Name <deployment-name> -ResourceGroupName <resource-group-name> -TemplateUri https://raw.githubusercontent.com/azure/azure-quickstart-templates/master/spark-ubuntu-multidisks/azuredeploy.json
Install and configure Azure PowerShell

Command line

azure config mode arm
azure group deployment create <my-resource-group> <my-deployment-name> --template-uri https://raw.githubusercontent.com/azure/azure-quickstart-templates/master/spark-ubuntu-multidisks/azuredeploy.json
Install and Configure the Azure Cross-Platform Command-Line Interface