Solution architecture: HPC cluster deployed in the cloud

High-performance computing (HPC) applications can scale to thousands of compute cores, extend on-premises big compute or run as a 100% cloud-native solution. This HPC solution, including the head node, compute nodes and storage nodes, runs in Azure with no hardware infrastructure to maintain.

This solution is built on the Azure-managed services: Virtual Machine Scale Sets, Virtual Network and Storage. These services run in a high-availability environment that is patched and supported, allowing you to focus on your solution instead of the environment they run in.

HPC cluster deployed in the cloud A diagram showing the solution architecture of a high-performance computing cluster deployed in the cloud, built on the Azure-managed services Virtual Machine Scale Sets, Virtual Network and Storage. OR Availability Set ARM template Script file ClusterHead Node Virtual Machines VM Scale Set RDMA Network A8, A9, and H SeriesVirtual Machines Storage Virtual Network

Implementation guidance

Products Documentation

HPC head node

The HPC head node runs on premises or within Azure as a virtual machine.
Size A8/A9 Azure virtual machines provide HPC compute nodes running on Windows or Linux operating systems.
RDMA networking, available with A8 and A9 instances, is used to achieve high bandwidth and microsecond latencies between compute nodes.
Storage nodes can also be run within virtual machines.

Virtual Machine Scale Sets

Availability sets ensure that the application is available and resilient to updates and hardware faults.
Virtual machines communicating via RDMA are placed within the same availability set.

Virtual Network

Virtual Network provides IP connectivity between the head node, compute nodes and storage nodes.

Storage

Azure Storage blobs store the disks that back the virtual machines and provide long-term storage of unstructured data and executable files used by the HPC application.

Azure Resource Manager templates

Resource Manager templates or script files are used to deploy your application to the HPC environment.

Related solution architectures

On-premises HPC implementation bursting to Azure

High-performance computing (HPC) applications can scale to thousands of compute cores, extend on-premises big compute or run as a 100% cloud-native solution. This HPC solution can extend its computational capacity by leveraging the compute-intensive instances of Virtual Machines running in Azure and accessed via Express Route or a VPN.

Learn more

Big compute solutions as a service

High-performance computing (HPC) applications can scale to thousands of compute cores, extend on-premises big compute or run as a 100% cloud-native solution. This HPC solution is implemented with Azure Batch, which provides job scheduling, auto-scaling of compute resources and execution management of platforms as a service (PaaS) that reduces HPC infrastructure code and maintenance.

Learn more