Solution architecture: On-premises HPC implementation bursting to Azure

High performance computing (HPC) applications can scale to thousands of compute cores, extend on-premises big compute or run as a 100% cloud native solution. This HPC solution can extend its computational capacity by leveraging the compute-intensive instances of Virtual Machines running in Azure and accessed via Express Route or VPN.

This solution is built on the Azure managed services: Virtual Machines, Virtual Network, VPN Gateway, ExpressRoute and Storage. These services run in a high-availability environment, patched and supported, allowing you to focus on your solution instead of the environment they run in.

On-premises HPC implementation bursting to Azure A diagram showing the solution architecture of an on-premises, high performance computing implementation bursting to Azure, built on the Azure managed services Virtual Machines, Virtual Network, VPN Gateway, ExpressRoute and Storage. HPC Compute Cluster OR OR VM Scale Set Virtual Machines Virtual Network Azure On-Premise Storage VPN Express Route Cluster Nodes Cluster Head Node

Implementation guidance

Products Documentation

HPC head node

The HPC head node runs on-premises or within Azure as a virtual machine.
Azure virtual machines of size A8/A9 provide the HPC compute nodes, running on Windows or the Linux operating system.
RDMA networking available with A8 and A9 instances is used to achieve high bandwidth and microsecond latencies between compute nodes.
Storage nodes can also be run within virtual machines.

Virtual Machines

Availability sets ensure that the application is available and resilient to updates and hardware faults..
Virtual machines communicating via RDMA are placed within the same availability set.

Virtual Network

Virtual Network provides IP connectivity between the head node, compute nodes and storage nodes.

VPN Gateway

A virtual private gateway provides the Azure endpoint for a VPN connection between an on-premises VPN device and the Azure-based virtual network.
This enables secure connectivity across the Internet between an on-premises head node or cluster and the compute nodes running in Azure.

ExpressRoute

ExpressRoute provides secure, reliable, high-speed and low-latency connectivity between on-premises resources and Azure resources using private connections that don’t traverse the public Internet.

Storage

Azure Storage blobs store the disks backing the virtual machines and provides long-term storage of unstructured data and executable files used by the HPC application.

Related solution architectures

Big compute solutions as a service

High performance computing (HPC) applications can scale to thousands of compute cores, extend on-premises big compute or run as a 100% cloud native solution. This HPC solution is implemented with Azure Batch, which provides job scheduling, auto-scaling of compute resources and execution management as a platform service (PaaS) that reduces HPC infrastructure code and maintenance.

Learn more