středa 7. července 2021
All service engineering teams in Azure are already familiar with postmortems as a tool for better understanding what went wrong, how it went wrong, and the customer impact of the related outage. For today’s post in our Advancing Reliability blog series, we share insights into our journey as we work towards advancing our postmortem and resiliency threat modeling processes.
středa 30. června 2021
The continuous monitoring of health metrics is a fundamental part of this process, and this is where AIOps plays a critical role. In the post that follows, we introduce how AI and machine learning are used to empower DevOps engineers, monitor the Azure deployment process at scale, detect issues early, and make rollout or rollback decisions based on impact scope and severity.
pondělí 7. června 2021
There are many factors that can affect critical environment infrastructure availability—the reliability of the infrastructure building blocks, the controls during the datacenter construction stage, effective health monitoring and event detection schemes, a robust maintenance program, and operational excellence to ensure that every action is taken with careful consideration of related risk implications.
středa 24. března 2021
If you ask three people what a service is, you may get three different answers. At Microsoft, we define a service (business process or technology) as a means of delivering value to customers (first or third party) by facilitating outcomes customers want to achieve.
pondělí 22. března 2021
At Microsoft Ignite, I presented the “Inside Azure Datacenter Architecture” session to give a tour of the latest innovations around how Azure enables intelligent, modern, and innovative applications at scale in the cloud, on-premises, and on the edge.
pondělí 15. března 2021
Even before the pandemic accelerated digital transformation globally, the scalability and security advantages offered by Microsoft Azure prompted organizations large and small to migrate their data, applications, and workloads from on-premises data centers to the cloud.
čtvrtek 11. března 2021
Project Narya is a holistic, end-to-end prediction and mitigation service—named after the "ring of fire" from Lord of the Rings, known to resist the weariness of time.
pondělí 8. března 2021
Persistent threats like bootkits and rootkits are sophisticated malware types that run with the same kernel-mode privileges as the operating system they infect.
pondělí 16. listopadu 2020
In our two-part series on advancing global network reliability through intelligent software, we explain how we’ve approached our network design, and how we’re constantly working to improve both reliability and performance.
pondělí 9. listopadu 2020
In our two-part series on advancing global network reliability through intelligent software, we explain how we’ve approached our network design, and how we’re constantly working to improve both reliability and performance.