Configures Spark cluster & a single node Cassandra on CentOS

Last updated: 2/28/2016

Cost estimate

$675.3588000000

Estimated monthly cost

Understand how this was calculated and customize to your needs

Pricing calculator

This project configures a Spark cluster (1 master and n-slave nodes) and a single node Cassandra on Azure using CentOS 7.x. The base image starts with CentOS 7.1 and it is updated to the latest version as part of the provisioning steps.

This Azure Resource Manager template was created by a member of the community and not by Microsoft. Each Resource Manager template is licensed to you under a license agreement by its owner, not Microsoft. Microsoft is not responsible for Resource Manager templates provided and licensed by community members and does not screen for security, compatibility, or performance. Community Resource Manager templates are not supported under any Microsoft support program or service, and are made available AS IS without warranty of any kind.

Parameters

Parameter Name Description
storageMasterType Storage type that is used for master Spark node. This storage account is used to store VM disks.
storageSlaveType Storage type that is used for each of the slave Spark node. This storage account is used to store VM disks.
storageCassandraType Storage type that is used for Cassandra. This storage account is used to store VM disks.
vmMasterVMSize VM size for master Spark node. This VM can be sized smaller.
vmNumberOfSlaves Number of VMs to create to support the slaves. Each slave is created on it's own VM. Minimum of 2 & Maximum of 200 VMs.
vmSlaveVMSize VM size for slave Spark nodes. This VM should be sized based on workloads.
vmCassandraVMSize VM size for Cassandra node. This VM should be sized based on workloads.
vmAdminUserName Specific an admin username that should be used to login to the VM.
vmAdminPassword Specific an admin password that should be used to login to the VM.

Use the template

PowerShell

New-AzureRmResourceGroupDeployment -Name <deployment-name> -ResourceGroupName <resource-group-name> -TemplateUri https://raw.githubusercontent.com/azure/azure-quickstart-templates/master/spark-and-cassandra-on-centos/azuredeploy.json
Install and configure Azure PowerShell

Command line

azure config mode arm
azure group deployment create <my-resource-group> <my-deployment-name> --template-uri https://raw.githubusercontent.com/azure/azure-quickstart-templates/master/spark-and-cassandra-on-centos/azuredeploy.json
Install and Configure the Azure Cross-Platform Command-Line Interface