Solution architecture: Big compute solutions as a service

High performance computing (HPC) applications can scale to thousands of compute cores, extend on-premises big compute or run as a 100% cloud native solution. This HPC solution is implemented with Azure Batch, which provides job scheduling, auto-scaling of compute resources and execution management as a platform service (PaaS) that reduces HPC infrastructure code and maintenance.

This solution is built on the Azure managed services: Virtual Machines, Storage and Batch. These services run in a high-availability environment, patched and supported, allowing you to focus on your solution instead of the environment they run in.

Big compute solutions as a service A diagram showing the solution architecture of a big compute solution as a service, built on the Azure managed services Virtual Machines, Storage and Batch. Pool Storage Virtual Machines Client App Web App Batch

Implementation guidance

Products Documentation

Virtual Machines

Azure virtual machines of size A8/A9 provide the HPC compute nodes, running on Windows or the Linux operating system.
RDMA networking available with A8 and A9 instances is used to achieve high bandwidth and microsecond latencies between compute nodes.
Storage nodes can also be run within virtual machines.

Storage

Azure Storage blobs store the disks backing the virtual machines and provides long-term storage of unstructured data and executable files used by the HPC application.

Batch

Azure Batch sends large amounts of data to the compute engine running in the cloud.

Related solution architectures

On-premises HPC implementation bursting to Azure

High performance computing (HPC) applications can scale to thousands of compute cores, extend on-premises big compute or run as a 100% cloud native solution. This HPC solution can extend its computational capacity by leveraging the compute-intensive instances of Virtual Machines running in Azure and accessed via Express Route or VPN.

Learn more