-
Microsoft’s cloud supply chain is essential to deliver the infrastructure—servers, storage, and networking gear—that enables cloud reliability and growth.
-
Advancing Azure Virtual Machine availability transparency
Now, in addition to getting a fast notification when a VM’s availability is impacted, customers can expect a root cause to be added at a later point once our automated Root Cause Analysis (RCA) system identifies the failing Azure platform component that led to the VM failure. -
Advancing application reliability with the Azure Well-Architected Framework
We created the Azure Well-Architected Framework to help improve the quality of your workloads, and reliability is one of its five core pillars so for the latest post in our series, I have asked Cloud Advocate David Blank-Edelman to run through how best to approach using the framework to guide your conversations and design decisions in this space. -
Advancing resiliency threat modeling for large distributed systems
All service engineering teams in Azure are already familiar with postmortems as a tool for better understanding what went wrong, how it went wrong, and the customer impact of the related outage. -
Advancing safe deployment with AIOps—introducing Gandalf
The continuous monitoring of health metrics is a fundamental part of this process, and this is where AIOps plays a critical role. -
Advancing in-datacenter critical environment infrastructure availability
There are many factors that can affect critical environment infrastructure availability—the reliability of the infrastructure building blocks, the controls during the datacenter construction stage, effective health monitoring and event detection schemes, a robust maintenance program, and operational excellence to ensure that every action is taken with careful consideration of related risk implications. -
Advancing Azure business continuity management
If you ask three people what a service is, you may get three different answers. -
Learn about the latest innovations: Inside Azure Datacenter Architecture
At Microsoft Ignite, I presented the “Inside Azure Datacenter Architecture” session to give a tour of the latest innovations around how Azure enables intelligent, modern, and innovative applications at scale in the cloud, on-premises, and on the edge. -
Azure and AMD announce landmark in confidential computing evolution
Even before the pandemic accelerated digital transformation globally, the scalability and security advantages offered by Microsoft Azure prompted organizations large and small to migrate their data, applications, and workloads from on-premises data centers to the cloud. -
Advancing failure prediction and mitigation—introducing Narya
Project Narya is a holistic, end-to-end prediction and mitigation service—named after the “ring of fire” from Lord of the Rings, known to resist the weariness of time. -
Announcing preview of Azure Trusted Launch for virtual machines
Persistent threats like bootkits and rootkits are sophisticated malware types that run with the same kernel-mode privileges as the operating system they infect. -
Advancing global network reliability through intelligent software—part 2 of 2
In our two-part series on advancing global network reliability through intelligent software, we explain how we’ve approached our network design, and how we’re constantly working to improve both reliability and performance.