Cloudera Cluster

Last updated: 3/23/2017

This template deploys a multi VM Cloudera cluster, with one node running Cloudera Manager, two name nodes, and N data nodes.

This Azure Resource Manager template was created by a member of the community and not by Microsoft. Each Resource Manager template is licensed to you under a license agreement by its owner, not Microsoft. Microsoft is not responsible for Resource Manager templates provided and licensed by community members and does not screen for security, compatibility, or performance. Community Resource Manager templates are not supported under any Microsoft support program or service, and are made available AS IS without warranty of any kind.

Parameters

Parameter Name Description
adminUserName Admin user name for the VMs
adminPassword Admin password for the VMs (A mix of upper and lower-case characters, digits and symbols)
cmUsername User name for the Cloudera Manager
cmPassword password for the Cloudera Manager (A mix of upper and lower-case characters, digits and symbols)
storageAccountSuffix A label between 3 and 11 alphanumeric characters, inclusive. The final Storage Account name will be [13 random characters][suffix]. Only new storage accounts are supported.
dnsNamePrefix Unique public DNS name where the VMs will be exposed
masterStorageAccountType The type of the Storage Account to be created for master nodes (defaults to Premium_LRS)
workerStorageAccountType The type of the Storage Account to be created for worker nodes (defaults to Standard_LRS)
virtualNetworkName The name of the virtual network provisioned for the deployment
vnetNewOrExisting Indicator for new or exiting Virtual Network
virtualNetworkRGName Resource Group Name for Vnet. For new VNet leave it empty, otherwise type in existing resource group name
subnetName Subnet name for the virtual network where resources will be provisioned
addressPrefix Virtual Network address CIDR
subnetPrefix CIDR for the subnet where VMs will be placed
masterNodeIPAddress IP address for the first master
dataNodeIPOffSetFromMaster IP address from the master node, for example if the first master is 10.1.1.1, then the first dataNode would be 10.1.1.11
tshirtSize T-shirt size of the Cloudera cluster (Eval, Prod)
numberOfDataNodes Number of data nodes for Prod (defaults to 3)
vmSize The size of the VMs deployed in the cluster (defaults to Standard_DS14)
vmImage The OS VM Image (defaults to ClouderaCentOS6_7)
company Your Company
emailAddress your email
businessPhone your business phone number
firstName Your FirstName
lastName Your LastName
jobRole Job Role
jobFunction Job Function

Use the template

PowerShell

New-AzureRmResourceGroupDeployment -Name <deployment-name> -ResourceGroupName <resource-group-name> -TemplateUri https://raw.githubusercontent.com/azure/azure-quickstart-templates/master/cloudera-on-centos/azuredeploy.json
Install and configure Azure PowerShell

Command line

azure config mode arm
azure group deployment create <my-resource-group> <my-deployment-name> --template-uri https://raw.githubusercontent.com/azure/azure-quickstart-templates/master/cloudera-on-centos/azuredeploy.json
Install and Configure the Azure Cross-Platform Command-Line Interface

More templates by Jason Wang