{"id":31817,"date":"2024-02-22T08:00:00","date_gmt":"2024-02-22T16:00:00","guid":{"rendered":""},"modified":"2024-07-22T16:53:27","modified_gmt":"2024-07-22T23:53:27","slug":"advancing-microsoft-azure-resilience-with-chaos-studio","status":"publish","type":"post","link":"https:\/\/azure.microsoft.com\/en-us\/blog\/advancing-microsoft-azure-resilience-with-chaos-studio\/","title":{"rendered":"Advancing Microsoft Azure resilience with Chaos Studio"},"content":{"rendered":"\n<p class=\"wp-block-paragraph\"><em>&#8220;In a <a href=\"https:\/\/azure.microsoft.com\/en-us\/blog\/advancing-resilience-through-chaos-engineering-and-fault-injection\/\">previous blog post<\/a> in this series, we talked about using chaos engineering and fault injection techniques to validate the resilience of your cloud applications.\u202fChaos testing helps increase confidence in your applications by finding and fixing resiliency issues before they affect customers and streamlining your incident response by reducing or avoiding downtime, data loss, and customer dissatisfaction.\u202fTo enable this, we <a href=\"http:\/\/aka.ms\/AzureChaosStudio\">launched a new platform<\/a> for resilience validation through chaos testing\u2014Azure Chaos Studio. As of November 1, 2023, Chaos Studio is now generally available and ready to use in <a href=\"https:\/\/azure.microsoft.com\/en-us\/explore\/global-infrastructure\/products-by-region\/?products=chaos-studio\">17 production regions<\/a>. I\u2019ve asked Chris Ashton, Principal Program Manager from the Chaos Studio Engineering team to share more on when it\u2019s best to implement the key features that support reliability of your applications.&#8221;<strong> \u2014 Mark Russinovich, CTO, Azure<\/strong>.<\/em><\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\" \/>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"design-and-implement-validate-and-measure\">Design and implement, validate and measure\u202f<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Design for failure<\/strong>.<strong> <\/strong>The first step in building a resilient application is to start with the <a href=\"https:\/\/learn.microsoft.com\/en-us\/azure\/well-architected\/\" target=\"_blank\" rel=\"noreferrer noopener\">Microsoft Azure Well-Architected Framework<\/a>\u202fand leverage the guidance to architect an application that is designed to handle failure. Build resilience into your application through the use of availability zones, region pairing, backups, and other recommended techniques. Incorporate Azure Monitor to enable observation of your application\u2019s health. Establish health measures for your application and track key metrics like Service Level Objective (SLO), Recovery Time Objective (RTO), Recovery Point Objective (RPO), and other metrics that are meaningful for your application and business. Before deploying your application to production for customer use, however, you want to verify that it actually handles disruptive conditions as expected and that it is truly resilient. This is where chaos engineering and <a href=\"https:\/\/azure.microsoft.com\/en-us\/products\/chaos-studio\/\">Microsoft Azure Chaos Studio<\/a> come in.\u202f<\/p>\n\n\n<div class=\"wp-block-msxcm-cta-block theme-dark\" data-moray data-bi-an=\"CTA Block\">\n\t<div class=\"card d-block mx-ng mx-md-0\">\n\t\t<div class=\"row no-gutters bg-gray-800 text-white\">\n\n\t\t\t\t\t\t\t<div class=\"col-md-4\">\n\t\t\t\t\t<img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"576\" src=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/01\/MSFT_Blog_Azure_Jan_24_Template_20240124_V1-1024x576.webp\" class=\"card-img img-object-cover\" alt=\"a man standing in front of a computer screen\" srcset=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/01\/MSFT_Blog_Azure_Jan_24_Template_20240124_V1-1024x576.webp 1024w, https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/01\/MSFT_Blog_Azure_Jan_24_Template_20240124_V1-300x169.webp 300w, https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/01\/MSFT_Blog_Azure_Jan_24_Template_20240124_V1-768x432.webp 768w, https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/01\/MSFT_Blog_Azure_Jan_24_Template_20240124_V1.webp 1280w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/>\t\t\t\t<\/div>\n\t\t\t\n\t\t\t<div class=\"d-flex col-md\">\n\t\t\t\t<div class=\"card-body align-self-center p-4 p-md-5\">\n\t\t\t\t\t\n\t\t\t\t\t<h2>Azure Chaos Studio<\/h2>\n\n\t\t\t\t\t<div class=\"mb-3\">\n\t\t\t\t\t\t<p>Improve application resilience with chaos engineering and testing<\/p>\n\t\t\t\t\t<\/div>\n\n\t\t\t\t\t\t\t\t\t\t\t<div class=\"link-group\">\n\t\t\t\t\t\t\t<a href=\"https:\/\/azure.microsoft.com\/en-us\/products\/chaos-studio\/\" class=\"btn btn-link text-decoration-none p-0 text-light-blue\" target=\"_blank\">\n\t\t\t\t\t\t\t\t<span>Explore the benefits<\/span>\n\t\t\t\t\t\t\t\t<span class=\"glyph-append glyph-append-chevron-right glyph-append-xsmall\"><\/span>\n\t\t\t\t\t\t\t<\/a>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t<\/div>\n\n\t\t\t\t\t<\/div>\n\t<\/div>\n<\/div>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Chaos engineering<\/strong> is the practice of injecting faults into an application to validate its resilience to the real-world outage scenarios it will encounter in production. Chaos engineering is more than testing\u2014it allows you to validate architecture choices, configuration settings, code quality, and monitoring components, as well as your incident response process. Chaos engineering is best applied by using the scientific method: <\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li class=\"wp-block-list-item\">Form a hypothesis<\/li>\n\n\n\n<li class=\"wp-block-list-item\">Perform fault injection experiments to validate it<\/li>\n\n\n\n<li class=\"wp-block-list-item\">Analyze the results<\/li>\n\n\n\n<li class=\"wp-block-list-item\">Make changes<\/li>\n\n\n\n<li class=\"wp-block-list-item\">Repeat<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Chaos validation<\/strong> can be added to automated release pipeline validation or can be performed manually as a drill event, often called a &#8220;game day.&#8221; Adding chaos to your continuous integration (CI), continuous delivery (CD), and continuous validation (CV) pipeline allows you to gate code flow based on the outcome, gives confidence in the ability to handle nominal conditions, and allows you to continually evaluate the resilience of new code in an ever-changing cloud environment. Chaos can also be combined with load, end-to-end, and other test cases to augment their coverage. Chaos drills and game days can be used less frequently to validate more rare and extreme outage scenarios and to prove disaster recovery (DR) capabilities.\u202f<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Chaos testing<\/strong> is used in many organizations in a variety of ways. Some teams perform monthly drill events, others have added automated Chaos to release pipeline automation, and some do both. Usually, the purpose of drill events is to validate resilience to a specific real-world scenario, such as AAD or Domain Name System (DNS) going down, or to prove Business Continuity and Disaster Recovery (BCDR) compliance. Aspects of drills can be automated, but they require people to plan, orchestrate, monitor, and analyze the resilience of the system under test.\u202f<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">In CI\/CD release pipeline automation, the goal is to fully automate resilience validation and catch defects early. Based on the results, many teams block production deployment if their chaos validation fails. Some teams have chaos testing success metrics they track for \u201cresiliency regressions caught\u201d and \u201cincidents prevented.\u201d On the Chaos Studio team, we perform scenario-focused drills against the different microservices that make up the product. We also use chaos testing as a way to train new on-call engineers. In doing so, engineers can see the impact of a real issue and learn the steps of monitoring, analyzing, and deploying a fix in a safe environment without the pressure to fix a customer-impacting issue during an actual outage. When a real issue does arise, they are better equipped to deal with it with confidence.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"inside-microsoft-azure-chaos-studio\">Inside Microsoft Azure Chaos Studio<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Chaos Studio <\/strong>is Microsoft\u2019s solution to<strong> <\/strong>help you measure, understand, improve, and maintain the resilience of your application through hypothesis-driven chaos experiments. Chaos Studio is deeply integrated with Azure to provide safe chaos validation at scale.<\/p>\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" src=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/02\/Azure-Chaos-Studio.webp\" alt=\"Diagram of the Chaos Studio microservices and how they interact with a customer application, Azure services, Azure Monitor, and Azure Load Testing.\" class=\"wp-image-31834 webp-format\" data-orig-src=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/02\/Azure-Chaos-Studio.webp\"><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">Chaos Studio provides:\u202f<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li class=\"wp-block-list-item\">A fully managed service to validate Microsoft Azure application and service resilience.\u202f<\/li>\n\n\n\n<li class=\"wp-block-list-item\">Deep Azure integration, including an Azure Portal user interface, Azure Resource Manager compliant REST APIs, and integration with <a href=\"https:\/\/learn.microsoft.com\/en-us\/azure\/azure-monitor\/overview\" target=\"_blank\" rel=\"noreferrer noopener\">Azure Monitor<\/a> and <a href=\"https:\/\/learn.microsoft.com\/en-us\/azure\/load-testing\/overview-what-is-azure-load-testing\" target=\"_blank\" rel=\"noreferrer noopener\">Azure Load Testing<\/a>\u2014all of which enable manual and automated creation, provisioning, and execution of fault injection experiments.\u202f<\/li>\n\n\n\n<li class=\"wp-block-list-item\">An expanding library of common resource pressure and dependency disruption faults and actions that work with your Azure infrastructure as a service (IaaS) and Azure platform as a service (PaaS) resources.\u202f<\/li>\n\n\n\n<li class=\"wp-block-list-item\">Advanced workflow orchestration of parallel and sequential fault actions that enables simulation of real-world disruption and outage scenarios.\u202f<\/li>\n\n\n\n<li class=\"wp-block-list-item\">Safeguards that minimize the impact radius and enable control of who performs experiments and in what environments.\u202f<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">A <strong>chaos experiment<\/strong> is where all the action happens. There are several key components of a chaos experiment:\u202f<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li class=\"wp-block-list-item\">Your application to be validated. This must be deployed to a test environment, ideally one that is reflective of your production environment. While this could be your production environment, we recommend testing in an isolated environment, at least at first, to minimize potential impact to your customers.\u202f<\/li>\n\n\n\n<li class=\"wp-block-list-item\">Experiment targets are the Azure resources provisioned and enabled for use in chaos experiments which will have faults applied to them.\u202f<\/li>\n\n\n\n<li class=\"wp-block-list-item\">Fault actions are the orchestrated disruptions and actions to the application and its dependencies and are provided by Chaos Studio. These can be simple resource pressure faults like CPU, memory, and disk pressure, network delays and blocks, or more destructive actions like killing a process, shutting down a virtual machine (VM), causing an Azure Cosmos DB failover, and other activities like a simple delay or starting an Azure Load Testing load test case.&nbsp;<\/li>\n\n\n\n<li class=\"wp-block-list-item\">Traffic is a synthetic workload or actual customer traffic against the application to create\u202fproduction-like customer usage. Users may add synthetic load directly in chaos experiments by leveraging Azure Load Testing fault actions.<\/li>\n\n\n\n<li class=\"wp-block-list-item\">Monitoring is used to observe application health and behavior during an experiment.<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Real world scenarios<\/strong> can be validated by building experiments that leverage multiple faults at once. Systematic disruption of individual dependencies like Microsoft Azure Storage, SQL Server, or Azure Cache for Redis is very useful, but real value comes when validating real-world outage scenarios like an availability zone outage from a power outage in a datacenter, crush load due to a holiday sales event, tax day, or DNS going down. You can build experiments to regression test the root cause of your last major outage.\u202f<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"chaos-studio-best-practices-and-tips\">Chaos Studio best practices and tips<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Chaos Studio<\/strong> allows you to monitor and improve your applications by providing tight integration with Azure Monitor and your CI\/CD pipelines. By integrating with Azure Monitor, you have a view into the lifecycle of your experiments including in-depth data on timing and the faults and resources targeted by the experiment. This data can live side-by-side with your existing Azure Monitor dashboards or added to your external monitoring dashboards.\u202fBy incorporating Chaos Studio into your CI\/CD pipeline, it allows you to continuously validate the resilience of your system by running chaos experiments as part of your build and deployment process.&nbsp;<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">To help you get started with your chaos journey, here are a few tips and practices that have helped others:\u202f<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li class=\"wp-block-list-item\"><strong>Pilot<\/strong>: Don\u2019t just jump in and start injecting faults. While that can be fun, take a methodical approach and set up a throw-away test environment to practice onboarding targets, creating experiments, setting up monitoring, and running the experiments to figure out how different faults work and how they impact different resources. Once you\u2019re used to the product, spend time to determine how to safely deploy chaos into a broader, production-like test environment.&nbsp;<\/li>\n\n\n\n<li class=\"wp-block-list-item\"><strong>Hypotheses<\/strong>: Formulate resilience hypotheses based on your application architecture and think about the experiments you want to perform, the things you want to validate, and the scenarios you should be resilient to.<\/li>\n\n\n\n<li class=\"wp-block-list-item\"><strong>Drill<\/strong>: Pick a hypothesis and plan for a drill event. Line up experiments related to the hypotheses, ensure monitoring is in place, notify other users of the test environment, do a pre-drill health check, and then run your experiment to inject faults. During the drill, monitor your application health. After, conduct a retrospective to analyze results and compare against hypotheses.<\/li>\n\n\n\n<li class=\"wp-block-list-item\"><strong>Automation<\/strong>: To further improve resiliency in your software development lifecycle, you can gate your production code flow based on the outcomes of automated Chaos validation.\u202f<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">This should give you a basic understanding of how chaos engineering and Chaos Studio can assist you in enhancing and preserving your application resilience, so that you can confidently launch to production.\u202f<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"discover-the-benefits-of-chaos-studio\">Discover the benefits of Chaos Studio<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">To begin your journey on <a href=\"https:\/\/azure.microsoft.com\/en-us\/products\/chaos-studio\/\">Chaos Studio<\/a>, <a href=\"https:\/\/docs.microsoft.com\/en-us\/azure\/chaos-studio\/\" target=\"_blank\" rel=\"noreferrer noopener\">consult the documentation<\/a> for a summary of concepts and how-to guides. Once you grasp the benefits of chaos testing and Chaos Studio, a crucial next step is to incorporate this into your release pipeline validation.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Microsoft Azure Chaos Studio solution helps you measure, understand, improve, and maintain the resilience of your application through hypothesis-driven chaos experiments.<\/p>\n","protected":false},"author":42,"featured_media":31842,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"ms_queue_id":[],"ep_exclude_from_search":false,"_classifai_error":"","_classifai_text_to_speech_error":"","_alt_title":"","footnotes":"","msx_community_cta_settings":[]},"categories":[1456,1482],"tags":[1505,1508],"audience":[3055],"content-type":[1511,1481],"product":[2912,1538,1853,1533,2934],"tech-community":[],"topic":[],"coauthors":[21],"class_list":["post-31817","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-devops","category-management-and-governance","tag-advancing-reliability","tag-azure-well-architected-framework","audience-developers","content-type-best-practices","content-type-thought-leadership","product-azure-cache-for-redis","product-azure-cosmos-db","product-azure-load-testing","product-azure-monitor","product-microsoft-entra-domain-services","review-flag-1680286581-295","review-flag-1-1680286581-825","review-flag-lever-1680286579-649","review-flag-microsofts","review-flag-new-1680286579-546","review-flag-vm-1680286585-143"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.2 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Advancing Microsoft Azure resilience with Chaos Studio | Microsoft Azure Blog<\/title>\n<meta name=\"description\" content=\"Microsoft Azure Chaos Studio is deeply integrated to provide safe chaos validation at scale. Learn more.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/azure.microsoft.com\/en-us\/blog\/advancing-microsoft-azure-resilience-with-chaos-studio\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Advancing Microsoft Azure resilience with Chaos Studio | Microsoft Azure Blog\" \/>\n<meta property=\"og:description\" content=\"Microsoft Azure Chaos Studio is deeply integrated to provide safe chaos validation at scale. Learn more.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/azure.microsoft.com\/en-us\/blog\/advancing-microsoft-azure-resilience-with-chaos-studio\/\" \/>\n<meta property=\"og:site_name\" content=\"Microsoft Azure Blog\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/microsoftazure\" \/>\n<meta property=\"article:published_time\" content=\"2024-02-22T16:00:00+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-07-22T23:53:27+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/02\/MSFT_Azure_FEB20_308179_Blog_Blogheader_240219_V01-social.png\" \/>\n\t<meta property=\"og:image:width\" content=\"1116\" \/>\n\t<meta property=\"og:image:height\" content=\"628\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Mark Russinovich\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:image\" content=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/02\/MSFT_Azure_FEB20_308179_Blog_Blogheader_240219_V01-social.png\" \/>\n<meta name=\"twitter:creator\" content=\"@azure\" \/>\n<meta name=\"twitter:site\" content=\"@azure\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Mark Russinovich\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"8 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/advancing-microsoft-azure-resilience-with-chaos-studio\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/advancing-microsoft-azure-resilience-with-chaos-studio\/\"},\"author\":[{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/author\/mark-russinovich\/\",\"@type\":\"Person\",\"@name\":\"Mark Russinovich\"}],\"headline\":\"Advancing Microsoft Azure resilience with Chaos Studio\",\"datePublished\":\"2024-02-22T16:00:00+00:00\",\"dateModified\":\"2024-07-22T23:53:27+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/advancing-microsoft-azure-resilience-with-chaos-studio\/\"},\"wordCount\":1546,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization\"},\"image\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/advancing-microsoft-azure-resilience-with-chaos-studio\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/02\/MSFT_Azure_FEB20_308179_Blog_Blogheader_240219_V01-featured.webp\",\"keywords\":[\"Advancing reliability\",\"Azure Well-Architected Framework\"],\"articleSection\":[\"DevOps\",\"Management and governance\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/azure.microsoft.com\/en-us\/blog\/advancing-microsoft-azure-resilience-with-chaos-studio\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/advancing-microsoft-azure-resilience-with-chaos-studio\/\",\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/advancing-microsoft-azure-resilience-with-chaos-studio\/\",\"name\":\"Advancing Microsoft Azure resilience with Chaos Studio | Microsoft Azure Blog\",\"isPartOf\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/advancing-microsoft-azure-resilience-with-chaos-studio\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/advancing-microsoft-azure-resilience-with-chaos-studio\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/02\/MSFT_Azure_FEB20_308179_Blog_Blogheader_240219_V01-featured.webp\",\"datePublished\":\"2024-02-22T16:00:00+00:00\",\"dateModified\":\"2024-07-22T23:53:27+00:00\",\"description\":\"Microsoft Azure Chaos Studio is deeply integrated to provide safe chaos validation at scale. Learn more.\",\"breadcrumb\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/advancing-microsoft-azure-resilience-with-chaos-studio\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/azure.microsoft.com\/en-us\/blog\/advancing-microsoft-azure-resilience-with-chaos-studio\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/advancing-microsoft-azure-resilience-with-chaos-studio\/#primaryimage\",\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/02\/MSFT_Azure_FEB20_308179_Blog_Blogheader_240219_V01-featured.webp\",\"contentUrl\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/02\/MSFT_Azure_FEB20_308179_Blog_Blogheader_240219_V01-featured.webp\",\"width\":800,\"height\":450,\"caption\":\"Satisfied executive customer working and collaborating with investment advisor on investment decisions using intelligent apps powered by Azure.\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/advancing-microsoft-azure-resilience-with-chaos-studio\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Blog home\",\"item\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Management and governance\",\"item\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/category\/management-and-governance\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"Advancing Microsoft Azure resilience with Chaos Studio\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#website\",\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/\",\"name\":\"Microsoft Azure Blog\",\"description\":\"Get the latest Azure news, updates, and announcements from the Azure blog. From product updates to hot topics, hear from the Azure experts.\",\"publisher\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization\",\"name\":\"Microsoft Azure Blog\",\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/06\/microsoft_logo.webp\",\"contentUrl\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/06\/microsoft_logo.webp\",\"width\":512,\"height\":512,\"caption\":\"Microsoft Azure Blog\"},\"image\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/www.facebook.com\/microsoftazure\",\"https:\/\/x.com\/azure\",\"https:\/\/www.instagram.com\/microsoftdeveloper\/\",\"https:\/\/www.linkedin.com\/company\/16188386\",\"https:\/\/www.youtube.com\/user\/windowsazure\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/person\/b2603da1afac705823964361ce9072c0\",\"name\":\"Kristin Gallagher\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/secure.gravatar.com\/avatar\/295fa37b6bb2bbf59603c38b6ac7a7b4b86cd0f736387182fa9d0117f52cdf5e?s=96&d=mm&r=gb83eb8c5c3f8feea9763b473dabe8524\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/295fa37b6bb2bbf59603c38b6ac7a7b4b86cd0f736387182fa9d0117f52cdf5e?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/295fa37b6bb2bbf59603c38b6ac7a7b4b86cd0f736387182fa9d0117f52cdf5e?s=96&d=mm&r=g\",\"caption\":\"Kristin Gallagher\"},\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/author\/kristingallagher\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Advancing Microsoft Azure resilience with Chaos Studio | Microsoft Azure Blog","description":"Microsoft Azure Chaos Studio is deeply integrated to provide safe chaos validation at scale. Learn more.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/azure.microsoft.com\/en-us\/blog\/advancing-microsoft-azure-resilience-with-chaos-studio\/","og_locale":"en_US","og_type":"article","og_title":"Advancing Microsoft Azure resilience with Chaos Studio | Microsoft Azure Blog","og_description":"Microsoft Azure Chaos Studio is deeply integrated to provide safe chaos validation at scale. Learn more.","og_url":"https:\/\/azure.microsoft.com\/en-us\/blog\/advancing-microsoft-azure-resilience-with-chaos-studio\/","og_site_name":"Microsoft Azure Blog","article_publisher":"https:\/\/www.facebook.com\/microsoftazure","article_published_time":"2024-02-22T16:00:00+00:00","article_modified_time":"2024-07-22T23:53:27+00:00","og_image":[{"width":1116,"height":628,"url":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/02\/MSFT_Azure_FEB20_308179_Blog_Blogheader_240219_V01-social.png","type":"image\/png"}],"author":"Mark Russinovich","twitter_card":"summary_large_image","twitter_image":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/02\/MSFT_Azure_FEB20_308179_Blog_Blogheader_240219_V01-social.png","twitter_creator":"@azure","twitter_site":"@azure","twitter_misc":{"Written by":"Mark Russinovich","Est. reading time":"8 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/advancing-microsoft-azure-resilience-with-chaos-studio\/#article","isPartOf":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/advancing-microsoft-azure-resilience-with-chaos-studio\/"},"author":[{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/author\/mark-russinovich\/","@type":"Person","@name":"Mark Russinovich"}],"headline":"Advancing Microsoft Azure resilience with Chaos Studio","datePublished":"2024-02-22T16:00:00+00:00","dateModified":"2024-07-22T23:53:27+00:00","mainEntityOfPage":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/advancing-microsoft-azure-resilience-with-chaos-studio\/"},"wordCount":1546,"commentCount":0,"publisher":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization"},"image":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/advancing-microsoft-azure-resilience-with-chaos-studio\/#primaryimage"},"thumbnailUrl":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/02\/MSFT_Azure_FEB20_308179_Blog_Blogheader_240219_V01-featured.webp","keywords":["Advancing reliability","Azure Well-Architected Framework"],"articleSection":["DevOps","Management and governance"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/azure.microsoft.com\/en-us\/blog\/advancing-microsoft-azure-resilience-with-chaos-studio\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/advancing-microsoft-azure-resilience-with-chaos-studio\/","url":"https:\/\/azure.microsoft.com\/en-us\/blog\/advancing-microsoft-azure-resilience-with-chaos-studio\/","name":"Advancing Microsoft Azure resilience with Chaos Studio | Microsoft Azure Blog","isPartOf":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/advancing-microsoft-azure-resilience-with-chaos-studio\/#primaryimage"},"image":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/advancing-microsoft-azure-resilience-with-chaos-studio\/#primaryimage"},"thumbnailUrl":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/02\/MSFT_Azure_FEB20_308179_Blog_Blogheader_240219_V01-featured.webp","datePublished":"2024-02-22T16:00:00+00:00","dateModified":"2024-07-22T23:53:27+00:00","description":"Microsoft Azure Chaos Studio is deeply integrated to provide safe chaos validation at scale. Learn more.","breadcrumb":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/advancing-microsoft-azure-resilience-with-chaos-studio\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/azure.microsoft.com\/en-us\/blog\/advancing-microsoft-azure-resilience-with-chaos-studio\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/advancing-microsoft-azure-resilience-with-chaos-studio\/#primaryimage","url":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/02\/MSFT_Azure_FEB20_308179_Blog_Blogheader_240219_V01-featured.webp","contentUrl":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/02\/MSFT_Azure_FEB20_308179_Blog_Blogheader_240219_V01-featured.webp","width":800,"height":450,"caption":"Satisfied executive customer working and collaborating with investment advisor on investment decisions using intelligent apps powered by Azure."},{"@type":"BreadcrumbList","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/advancing-microsoft-azure-resilience-with-chaos-studio\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Blog home","item":"https:\/\/azure.microsoft.com\/en-us\/blog\/"},{"@type":"ListItem","position":2,"name":"Management and governance","item":"https:\/\/azure.microsoft.com\/en-us\/blog\/category\/management-and-governance\/"},{"@type":"ListItem","position":3,"name":"Advancing Microsoft Azure resilience with Chaos Studio"}]},{"@type":"WebSite","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#website","url":"https:\/\/azure.microsoft.com\/en-us\/blog\/","name":"Microsoft Azure Blog","description":"Get the latest Azure news, updates, and announcements from the Azure blog. From product updates to hot topics, hear from the Azure experts.","publisher":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/azure.microsoft.com\/en-us\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization","name":"Microsoft Azure Blog","url":"https:\/\/azure.microsoft.com\/en-us\/blog\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/06\/microsoft_logo.webp","contentUrl":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/06\/microsoft_logo.webp","width":512,"height":512,"caption":"Microsoft Azure Blog"},"image":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/microsoftazure","https:\/\/x.com\/azure","https:\/\/www.instagram.com\/microsoftdeveloper\/","https:\/\/www.linkedin.com\/company\/16188386","https:\/\/www.youtube.com\/user\/windowsazure"]},{"@type":"Person","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/person\/b2603da1afac705823964361ce9072c0","name":"Kristin Gallagher","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/secure.gravatar.com\/avatar\/295fa37b6bb2bbf59603c38b6ac7a7b4b86cd0f736387182fa9d0117f52cdf5e?s=96&d=mm&r=gb83eb8c5c3f8feea9763b473dabe8524","url":"https:\/\/secure.gravatar.com\/avatar\/295fa37b6bb2bbf59603c38b6ac7a7b4b86cd0f736387182fa9d0117f52cdf5e?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/295fa37b6bb2bbf59603c38b6ac7a7b4b86cd0f736387182fa9d0117f52cdf5e?s=96&d=mm&r=g","caption":"Kristin Gallagher"},"url":"https:\/\/azure.microsoft.com\/en-us\/blog\/author\/kristingallagher\/"}]}},"msxcm_display_generated_audio":false,"msxcm_animated_featured_image":null,"distributor_meta":false,"distributor_terms":false,"distributor_media":false,"distributor_original_site_name":"Microsoft Azure Blog","distributor_original_site_url":"https:\/\/azure.microsoft.com\/en-us\/blog","push-errors":false,"_links":{"self":[{"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/posts\/31817","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/users\/42"}],"replies":[{"embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/comments?post=31817"}],"version-history":[{"count":0,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/posts\/31817\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/media\/31842"}],"wp:attachment":[{"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/media?parent=31817"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/categories?post=31817"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/tags?post=31817"},{"taxonomy":"audience","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/audience?post=31817"},{"taxonomy":"content-type","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/content-type?post=31817"},{"taxonomy":"product","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/product?post=31817"},{"taxonomy":"tech-community","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/tech-community?post=31817"},{"taxonomy":"topic","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/topic?post=31817"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/coauthors?post=31817"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}