{"id":47073,"date":"2025-10-13T17:35:00","date_gmt":"2025-10-14T00:35:00","guid":{"rendered":"https:\/\/azure.microsoft.com\/en-us\/blog\/?p=47073"},"modified":"2025-10-15T13:37:37","modified_gmt":"2025-10-15T20:37:37","slug":"accelerating-open-source-infrastructure-development-for-frontier-ai-at-scale","status":"publish","type":"post","link":"https:\/\/azure.microsoft.com\/en-us\/blog\/accelerating-open-source-infrastructure-development-for-frontier-ai-at-scale\/","title":{"rendered":"Accelerating open-source infrastructure development for frontier AI at scale"},"content":{"rendered":"\n<p class=\"wp-block-paragraph\">In the transition from building computing infrastructure for cloud scale to building cloud and AI infrastructure for frontier scale, the world of computing has experienced tectonic shifts in innovation. Throughout this journey, Microsoft has shared its learnings and best practices, optimizing&nbsp;our cloud infrastructure stack in cross-industry forums such as the Open Compute Project (OCP) Global Foundation.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Today, we see that the next phase of cloud infrastructure innovation is poised to be the most consequential period of transformation yet. In just the last year, Microsoft has added more than 2 gigawatts of new capacity and launched the world\u2019s most powerful AI datacenter, which delivers 10x the performance of the world\u2019s fastest supercomputer today. Yet, this is just the beginning.<\/p>\n\n\n\n<div class=\"wp-block-buttons is-content-justification-center is-layout-flex wp-container-core-buttons-is-layout-a89b3969 wp-block-buttons-is-layout-flex\">\n<div class=\"wp-block-button\"><a class=\"wp-block-button__link wp-element-button\" href=\"https:\/\/datacenters.microsoft.com\/\">Learn more about Microsoft\u2019s&nbsp;global infrastructure<\/a><\/div>\n<\/div>\n\n\n\n<p class=\"wp-block-paragraph\">Delivering AI infrastructure at the highest performance and lowest cost requires a systems approach, with optimizations across the stack to drive quality, speed, and resiliency at a level that can provide a consistent experience to our customers. In the quest to supply resilient, sustainable, secure, and widely scalable technology to handle the breadth of AI workloads, we\u2019re embarking on an ambitious new journey: one not just of redefining infrastructure innovation at every layer of execution from silicon to systems, but one of tightly integrated industry alignment on standards that offer a model for global interoperability and standardization.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">At this year\u2019s OCP Global Summit, Microsoft is contributing new standards across power, cooling, sustainability, security, networking, and fleet resiliency to further advance innovation in the industry.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"redefining-power-distribution-for-the-ai-era\">Redefining power distribution for the AI era<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">As AI workloads scale globally, hyperscale datacenters are experiencing unprecedented power density and distribution challenges.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Last year, at the OCP Global Summit, we partnered with Meta and Google in the development of Mt. Diablo, a disaggregated power architecture. This year,&nbsp;we\u2019re&nbsp;building on <a href=\"https:\/\/techcommunity.microsoft.com\/blog\/azureinfrastructureblog\/rethinking-power-conversion-and-distribution-for-the-ai-era\/4460759\">this innovation with the next step of our full-stack transformation<\/a> of datacenter power systems:&nbsp;solid-state transformers. Solid-state transformers simplify the power chain with new conversion technologies and protection schemes that can accommodate future rack voltage requirements.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Training large models across thousands of GPUs also introduces variable and intense power draw patterns that can strain the grid. The utility, and traditional power delivery systems. These fluctuations not only risk hardware reliability and operational efficiency but also create challenges across capacity planning and sustainability goals.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Together with key industry partners, Microsoft is leading a power stabilization initiative to address this challenge. In a recently published paper with OpenAI and NVIDIA\u2014<a href=\"https:\/\/arxiv.org\/abs\/2508.14318\" target=\"_blank\" rel=\"noreferrer noopener\">Power Stabilization for AI Training Datacenters<\/a>\u2014we address how full-stack innovations spanning rack-level hardware, firmware orchestration, predictive telemetry, and facility integration can smooth power spikes, reduce power overshoot by 40%, and mitigate operational risk and costs to enable predictable, and scalable power delivery for AI training clusters.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">This year, at the OCP Global Summit, Microsoft is joining forces with industry partners to launch a dedicated power stabilization workgroup. Our goal is to foster open collaboration across hyperscalers and hardware partners, sharing our learnings from full-stack innovation and inviting the community to co-develop new methodologies that address the unique power challenges of AI training datacenters. By building on the insights from our recently published white paper, we aim to accelerate industry-wide adoption of resilient, scalable power delivery solutions for the next generation of AI infrastructure. <a href=\"https:\/\/aka.ms\/ocp2025powerstabilization\" target=\"_blank\" rel=\"noreferrer noopener\">Read more about our power stabilization efforts<\/a>.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"cooling-innovations-for-resiliency\">Cooling innovations for resiliency<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">As the power profile for AI infrastructure changes, we are also continuing to rearchitect our cooling infrastructure to support evolving needs around energy consumption, space optimization, and overall datacenter sustainability. Various cooling solutions must be implemented to support the scale of our expansion\u2014as we seek to build new AI-scale datacenters, we are also utilizing Heat Exchanger Unit (HXU)-based liquid cooling to rapidly deploy new AI capacity within our existing air-cooled datacenter footprint.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Microsoft\u2019s next generation HXU is an upcoming OCP contribution that enables liquid cooling for high-performance AI systems in air-cooled datacenters, supporting global scalability and rapid deployment. The modular HXU design delivers 2X the performance of current models and maintains &gt;99.9%&nbsp;cooling service availability for AI workloads. No datacenter modifications are required, allowing seamless integration and expansion. <a href=\"https:\/\/aka.ms\/ocp2025_nextgenHXU\" target=\"_blank\" rel=\"noreferrer noopener\">Learn more about the next generation HXU here.<\/a>&nbsp;<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Meanwhile, we\u2019re continuing to innovate across multiple layers of the stack to address changes in power and heat dissipation\u2014utilizing facility water cooling at datacenter-scale, circulating liquid in&nbsp;closed-loops from server to chiller; and exploring on-chip cooling innovations like microfluidics to efficiently remove heat directly from the silicon.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"unified-networking-solutions-for-growing-infrastructure-demands\">Unified networking solutions for growing infrastructure demands&nbsp;<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">Scaling hundreds of thousands of GPUs to operate as a single, coherent system comes with significant challenges to create rack-scale interconnects that can deliver low-latency, high bandwidth fabrics that are both efficient and interoperable. As AI workloads grow exponentially and infrastructure demands intensify, we are exploring networking optimizations that can support these needs. To that end, we have developed solutions leveraging scale-up, scale-out, and Wide Area Network (WAN) solutions to enable large-scale distributed training.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">We partner closely with standards bodies, like UEC (Ultra Ethernet Consortium) and UALink, focused on innovation in networking technologies for this critical element of AI systems. We are also driving forward adoption of Ethernet for scale-up networking across the ecosystem and are excited to see the <a href=\"https:\/\/www.opencompute.org\/blog\/introducing-esun-advancing-ethernet-for-scale-up-ai-infrastructure-at-ocp\" target=\"_blank\" rel=\"noreferrer noopener\">Ethernet for Scale-up Networking (ESUN) workstream launch under the OCP Networking Project<\/a>. We look forward to promoting adoption of cutting-edge networking solutions and enabling multi-vendor Ecosystem based on open standards.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"security-sustainability-and-quality-fundamental-pillars-for-resilient-ai-operations\">Security, sustainability, and quality: Fundamental pillars for resilient AI operations<\/h2>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"defense-in-depth-trust-at-every-layer\">Defense in depth: Trust at every layer<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">Our comprehensive approach to scaling AI systems responsibly includes embedding trust and security into every layer of our platform. This year, we are introducing new security contributions that build on our existing body of work in hardware security and introduce new protocols that are uniquely fit to support new scientific breakthroughs that have been accelerated with the introduction of AI:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li class=\"wp-block-list-item\">Building on past years\u2019 contributions and Microsoft\u2019s collaboration with AMD, Google, and NVIDIA, we have further enhanced Caliptra, our open-source silicon root of trust The introduction of Caliptra 2.1 extends the hardware root-of-trust to a full security subsystem. <a href=\"https:\/\/aka.ms\/caliptra2.1\" target=\"_blank\" rel=\"noreferrer noopener\">Learn more about Caliptra 2.1 here<\/a>.<\/li>\n\n\n\n<li class=\"wp-block-list-item\">We have also added Adams Bridge 2.0 to Caliptra to extend support for quantum-resilient cryptographic algorithms to the root-of-trust.<\/li>\n\n\n\n<li class=\"wp-block-list-item\">Finally, we are contributing OCP Layered Open-source Cryptographic Key Management (L.O.C.K)\u2014a key management block for storage devices that secures media encryption keys in hardware. L.O.C.K was developed through collaboration between Google, Kioxia, Microsoft, Samsung,&nbsp;and Solidigm.<\/li>\n<\/ul>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"advancing-datacenter-scale-sustainability\">Advancing datacenter-scale sustainability&nbsp;<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">Sustainability continues to be a major area of opportunity for industry collaboration and standardization through communities such as the Open Compute Project. Working collaboratively as an ecosystem of hyperscalers and hardware partners is one catalyst to address the need for sustainable datacenter infrastructure that can effectively scale as compute demands continue to evolve. This year, we are pleased to continue our collaborations as part of OCP\u2019s Sustainability workgroup across areas such as carbon reporting, accounting, and circularity:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li class=\"wp-block-list-item\">Announced at this year\u2019s Global Summit, we are partnering with AWS, Google, and Meta to fund the Product Category Rule initiative under the OCP Sustainability workgroup, with the goal of standardizing carbon measurement methodology for devices and datacenter equipment.<\/li>\n\n\n\n<li class=\"wp-block-list-item\">Together with Google, Meta, OCP, Schneider Electric, and the iMasons Climate Accord, we are establishing the <a href=\"https:\/\/www.opencompute.org\/blog\/open-compute-project-foundation-and-imasons-develop-taxonomy-for-carbon-disclosure\" target=\"_blank\" rel=\"noreferrer noopener\">Embodied Carbon Disclosure Base Specification<\/a> to establish a common framework for reporting the carbon impact of datacenter equipment.<\/li>\n\n\n\n<li class=\"wp-block-list-item\">Microsoft is advancing the adoption of waste heat reuse (WHR). In partnership with the NetZero Innovation Hub, NREL, and EU and US collaborators, Microsoft has published <a href=\"https:\/\/www.opencompute.org\/projects\/heat-reuse\" target=\"_blank\" rel=\"noreferrer noopener\">heat reuse reference designs<\/a> and is developing an economic modeling tool which provide data center operators and waste heat off takers\/consumers the cost it takes to develop the waste heat reuse infrastructure based on the conditions like the size and capacity of the WHR system, season, location, WHR mandates and subsidies in place. These region-specific solutions help operators convert excess heat into usable energy\u2014meeting regulatory requirements and unlocking new capacity, especially in regions like Europe where heat reuse is becoming mandatory.<\/li>\n\n\n\n<li class=\"wp-block-list-item\">We have developed an open methodology for Life Cycle Assessment (LCA) at scale across large-scale IT hardware fleets to drive towards a \u201cgold standard\u201d in sustainable cloud infrastructure.<\/li>\n<\/ul>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"rethinking-node-management-fleet-operational-resiliency-for-the-frontier-era\">Rethinking node management: Fleet operational resiliency for the frontier era<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">As AI infrastructure scales at an unprecedented pace, Microsoft is investing in standardizing how diverse compute nodes are deployed, updated, monitored, and serviced across hyperscale datacenters. In collaboration with AMD, Arm, Google, Intel, Meta, and NVIDIA, we are driving a series of Open Compute Project (OCP) contributions focused on streamlining fleet operations, unifying firmware management, manageability interfaces and enhancing diagnostics, debug, and RAS (Reliability, Availability, and Serviceability) capabilities. This standardized approach to lifecycle management lays the foundation for consistent, scalable node operations during this period of rapid expansion. <a href=\"https:\/\/aka.ms\/ocp2025_lifecyclemanagement\" target=\"_blank\" rel=\"noreferrer noopener\">Read more about our approach to resilient fleet operations<\/a>.&nbsp;<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"paving-the-way-for-frontier-scale-ai-computing\">Paving the way for frontier-scale AI computing&nbsp;<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">As we enter a new era of frontier-scale AI development, Microsoft takes pride in leading the advancement of standards that will drive the future of globally deployable AI supercomputing. Our commitment is reflected in our active role <a href=\"https:\/\/www.opencompute.org\/blog\/realizing-the-open-data-center-ecosystem-vision\" target=\"_blank\" rel=\"noreferrer noopener\">in shaping the ecosystem<\/a> that enables scalable, secure, and reliable AI infrastructure across the globe. We invite attendees of this year\u2019s OCP Global Summit to connect with Microsoft at booth #B53 to discover our latest cloud hardware demonstrations. These demonstrations showcase our ongoing collaborations with partners throughout the OCP community, highlighting innovations that support the evolution of AI and cloud technologies.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"connect-with-microsoft-at-the-ocp-global-summit-2025-and-beyond\">Connect with Microsoft at the OCP Global Summit 2025 and beyond<\/h2>\n\n\n\n<ul class=\"wp-block-list\">\n<li class=\"wp-block-list-item\">Visit Microsoft at the OCP Global Summit at booth #B53.<\/li>\n\n\n\n<li class=\"wp-block-list-item\">Check out&nbsp;<a href=\"https:\/\/aka.ms\/OCP2025Sessions\" target=\"_blank\" rel=\"noreferrer noopener\">sessions delivered by Microsoft and partners<\/a>&nbsp;from OCP Summit 2025.<\/li>\n\n\n\n<li class=\"wp-block-list-item\">Take a&nbsp;<a href=\"https:\/\/aka.ms\/virtualdctour\" target=\"_blank\" rel=\"noreferrer noopener\">virtual tour<\/a>&nbsp;of Microsoft datacenters.<\/li>\n\n\n\n<li class=\"wp-block-list-item\">Learn more about Microsoft\u2019s&nbsp;<a href=\"https:\/\/datacenters.microsoft.com\/\">global infrastructure<\/a>.<\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>Microsoft is contributing new standards across power, cooling, sustainability, security, networking, and fleet resiliency to advance innovation.<\/p>\n","protected":false},"author":76,"featured_media":47216,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"ms_queue_id":["aiblog-content-sync"],"ep_exclude_from_search":false,"_classifai_error":"","_classifai_text_to_speech_error":"","_alt_title":"","footnotes":"","msx_community_cta_settings":[]},"categories":[1454],"tags":[2671,1819],"audience":[3055,3053,3056],"content-type":[1481],"product":[1803],"tech-community":[3004],"topic":[],"coauthors":[235,3126],"class_list":["post-47073","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ai-machine-learning","tag-ai","tag-datacenter","audience-developers","audience-it-decision-makers","audience-it-implementors","content-type-thought-leadership","product-azure-ai","review-flag-1680286581-295","review-flag-1-1680286581-825","review-flag-2-1680286581-601","review-flag-9-1680286581-259","review-flag-microsofts","review-flag-new-1680286579-546","review-flag-partn-1680286579-901","review-flag-partn-1680286579-300"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.2 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Accelerating open-source infrastructure development for frontier AI at scale | Microsoft Azure Blog<\/title>\n<meta name=\"description\" content=\"Learn how Microsoft is addressing unprecedented power density and distribution challenges within hyperscale datacenters as AI workloads scale.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/azure.microsoft.com\/en-us\/blog\/accelerating-open-source-infrastructure-development-for-frontier-ai-at-scale\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Accelerating open-source infrastructure development for frontier AI at scale | Microsoft Azure Blog\" \/>\n<meta property=\"og:description\" content=\"Learn how Microsoft is addressing unprecedented power density and distribution challenges within hyperscale datacenters as AI workloads scale.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/azure.microsoft.com\/en-us\/blog\/accelerating-open-source-infrastructure-development-for-frontier-ai-at-scale\/\" \/>\n<meta property=\"og:site_name\" content=\"Microsoft Azure Blog\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/microsoftazure\" \/>\n<meta property=\"article:published_time\" content=\"2025-10-14T00:35:00+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2025-10-15T20:37:37+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2025\/10\/Azure-3D-Illustrations-CloudMigration-Light-1.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"1920\" \/>\n\t<meta property=\"og:image:height\" content=\"1080\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Rani Borkar, Saurabh Dighe\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:image\" content=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2025\/10\/Azure-3D-Illustrations-CloudMigration-Light-1.jpg\" \/>\n<meta name=\"twitter:creator\" content=\"@azure\" \/>\n<meta name=\"twitter:site\" content=\"@azure\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Rani Borkar, Saurabh Dighe\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"8 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/accelerating-open-source-infrastructure-development-for-frontier-ai-at-scale\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/accelerating-open-source-infrastructure-development-for-frontier-ai-at-scale\/\"},\"author\":[{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/author\/rani-borkar\/\",\"@type\":\"Person\",\"@name\":\"Rani Borkar\"},{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/author\/saurabh-dighe\/\",\"@type\":\"Person\",\"@name\":\"Saurabh Dighe\"}],\"headline\":\"Accelerating open-source infrastructure development for frontier AI at scale\",\"datePublished\":\"2025-10-14T00:35:00+00:00\",\"dateModified\":\"2025-10-15T20:37:37+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/accelerating-open-source-infrastructure-development-for-frontier-ai-at-scale\/\"},\"wordCount\":1659,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization\"},\"image\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/accelerating-open-source-infrastructure-development-for-frontier-ai-at-scale\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2025\/10\/Azure-3D-Illustrations-CloudMigration-Light_final-scaled.jpg\",\"keywords\":[\"AI\",\"Datacenter\"],\"articleSection\":[\"AI + machine learning\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/azure.microsoft.com\/en-us\/blog\/accelerating-open-source-infrastructure-development-for-frontier-ai-at-scale\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/accelerating-open-source-infrastructure-development-for-frontier-ai-at-scale\/\",\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/accelerating-open-source-infrastructure-development-for-frontier-ai-at-scale\/\",\"name\":\"Accelerating open-source infrastructure development for frontier AI at scale | Microsoft Azure Blog\",\"isPartOf\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/accelerating-open-source-infrastructure-development-for-frontier-ai-at-scale\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/accelerating-open-source-infrastructure-development-for-frontier-ai-at-scale\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2025\/10\/Azure-3D-Illustrations-CloudMigration-Light_final-scaled.jpg\",\"datePublished\":\"2025-10-14T00:35:00+00:00\",\"dateModified\":\"2025-10-15T20:37:37+00:00\",\"description\":\"Learn how Microsoft is addressing unprecedented power density and distribution challenges within hyperscale datacenters as AI workloads scale.\",\"breadcrumb\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/accelerating-open-source-infrastructure-development-for-frontier-ai-at-scale\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/azure.microsoft.com\/en-us\/blog\/accelerating-open-source-infrastructure-development-for-frontier-ai-at-scale\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/accelerating-open-source-infrastructure-development-for-frontier-ai-at-scale\/#primaryimage\",\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2025\/10\/Azure-3D-Illustrations-CloudMigration-Light_final-scaled.jpg\",\"contentUrl\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2025\/10\/Azure-3D-Illustrations-CloudMigration-Light_final-scaled.jpg\",\"width\":2560,\"height\":1440,\"caption\":\"ProEXR File Description =Attributes= channels (chlist) compression (compression): Zip16 dataWindow (box2i): [0, 0, 3499, 1968] displayWindow (box2i): [0, 0, 3499, 1968] lineOrder (lineOrder): Increasing Y pixelAspectRatio (float): 1 screenWindowCenter (v\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/accelerating-open-source-infrastructure-development-for-frontier-ai-at-scale\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Blog home\",\"item\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"AI + machine learning\",\"item\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/category\/ai-machine-learning\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"Accelerating open-source infrastructure development for frontier AI at scale\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#website\",\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/\",\"name\":\"Microsoft Azure Blog\",\"description\":\"Get the latest Azure news, updates, and announcements from the Azure blog. From product updates to hot topics, hear from the Azure experts.\",\"publisher\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization\",\"name\":\"Microsoft Azure Blog\",\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/06\/microsoft_logo.webp\",\"contentUrl\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/06\/microsoft_logo.webp\",\"width\":512,\"height\":512,\"caption\":\"Microsoft Azure Blog\"},\"image\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/www.facebook.com\/microsoftazure\",\"https:\/\/x.com\/azure\",\"https:\/\/www.instagram.com\/microsoftdeveloper\/\",\"https:\/\/www.linkedin.com\/company\/16188386\",\"https:\/\/www.youtube.com\/user\/windowsazure\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/person\/83fe4c04c61d5e58d555ba137c01a107\",\"name\":\"Garry Guseltsev\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/secure.gravatar.com\/avatar\/8476ebc2bcbe54e1843bd5cce3ec249bed771194411b3052815d4c5d272128f2?s=96&d=mm&r=g4f09d3e62b774b84289036a84f6a8c1c\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/8476ebc2bcbe54e1843bd5cce3ec249bed771194411b3052815d4c5d272128f2?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/8476ebc2bcbe54e1843bd5cce3ec249bed771194411b3052815d4c5d272128f2?s=96&d=mm&r=g\",\"caption\":\"Garry Guseltsev\"},\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/author\/garryguseltsev\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Accelerating open-source infrastructure development for frontier AI at scale | Microsoft Azure Blog","description":"Learn how Microsoft is addressing unprecedented power density and distribution challenges within hyperscale datacenters as AI workloads scale.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/azure.microsoft.com\/en-us\/blog\/accelerating-open-source-infrastructure-development-for-frontier-ai-at-scale\/","og_locale":"en_US","og_type":"article","og_title":"Accelerating open-source infrastructure development for frontier AI at scale | Microsoft Azure Blog","og_description":"Learn how Microsoft is addressing unprecedented power density and distribution challenges within hyperscale datacenters as AI workloads scale.","og_url":"https:\/\/azure.microsoft.com\/en-us\/blog\/accelerating-open-source-infrastructure-development-for-frontier-ai-at-scale\/","og_site_name":"Microsoft Azure Blog","article_publisher":"https:\/\/www.facebook.com\/microsoftazure","article_published_time":"2025-10-14T00:35:00+00:00","article_modified_time":"2025-10-15T20:37:37+00:00","og_image":[{"width":1920,"height":1080,"url":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2025\/10\/Azure-3D-Illustrations-CloudMigration-Light-1.jpg","type":"image\/jpeg"}],"author":"Rani Borkar, Saurabh Dighe","twitter_card":"summary_large_image","twitter_image":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2025\/10\/Azure-3D-Illustrations-CloudMigration-Light-1.jpg","twitter_creator":"@azure","twitter_site":"@azure","twitter_misc":{"Written by":"Rani Borkar, Saurabh Dighe","Est. reading time":"8 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/accelerating-open-source-infrastructure-development-for-frontier-ai-at-scale\/#article","isPartOf":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/accelerating-open-source-infrastructure-development-for-frontier-ai-at-scale\/"},"author":[{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/author\/rani-borkar\/","@type":"Person","@name":"Rani Borkar"},{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/author\/saurabh-dighe\/","@type":"Person","@name":"Saurabh Dighe"}],"headline":"Accelerating open-source infrastructure development for frontier AI at scale","datePublished":"2025-10-14T00:35:00+00:00","dateModified":"2025-10-15T20:37:37+00:00","mainEntityOfPage":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/accelerating-open-source-infrastructure-development-for-frontier-ai-at-scale\/"},"wordCount":1659,"commentCount":0,"publisher":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization"},"image":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/accelerating-open-source-infrastructure-development-for-frontier-ai-at-scale\/#primaryimage"},"thumbnailUrl":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2025\/10\/Azure-3D-Illustrations-CloudMigration-Light_final-scaled.jpg","keywords":["AI","Datacenter"],"articleSection":["AI + machine learning"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/azure.microsoft.com\/en-us\/blog\/accelerating-open-source-infrastructure-development-for-frontier-ai-at-scale\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/accelerating-open-source-infrastructure-development-for-frontier-ai-at-scale\/","url":"https:\/\/azure.microsoft.com\/en-us\/blog\/accelerating-open-source-infrastructure-development-for-frontier-ai-at-scale\/","name":"Accelerating open-source infrastructure development for frontier AI at scale | Microsoft Azure Blog","isPartOf":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/accelerating-open-source-infrastructure-development-for-frontier-ai-at-scale\/#primaryimage"},"image":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/accelerating-open-source-infrastructure-development-for-frontier-ai-at-scale\/#primaryimage"},"thumbnailUrl":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2025\/10\/Azure-3D-Illustrations-CloudMigration-Light_final-scaled.jpg","datePublished":"2025-10-14T00:35:00+00:00","dateModified":"2025-10-15T20:37:37+00:00","description":"Learn how Microsoft is addressing unprecedented power density and distribution challenges within hyperscale datacenters as AI workloads scale.","breadcrumb":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/accelerating-open-source-infrastructure-development-for-frontier-ai-at-scale\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/azure.microsoft.com\/en-us\/blog\/accelerating-open-source-infrastructure-development-for-frontier-ai-at-scale\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/accelerating-open-source-infrastructure-development-for-frontier-ai-at-scale\/#primaryimage","url":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2025\/10\/Azure-3D-Illustrations-CloudMigration-Light_final-scaled.jpg","contentUrl":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2025\/10\/Azure-3D-Illustrations-CloudMigration-Light_final-scaled.jpg","width":2560,"height":1440,"caption":"ProEXR File Description =Attributes= channels (chlist) compression (compression): Zip16 dataWindow (box2i): [0, 0, 3499, 1968] displayWindow (box2i): [0, 0, 3499, 1968] lineOrder (lineOrder): Increasing Y pixelAspectRatio (float): 1 screenWindowCenter (v"},{"@type":"BreadcrumbList","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/accelerating-open-source-infrastructure-development-for-frontier-ai-at-scale\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Blog home","item":"https:\/\/azure.microsoft.com\/en-us\/blog\/"},{"@type":"ListItem","position":2,"name":"AI + machine learning","item":"https:\/\/azure.microsoft.com\/en-us\/blog\/category\/ai-machine-learning\/"},{"@type":"ListItem","position":3,"name":"Accelerating open-source infrastructure development for frontier AI at scale"}]},{"@type":"WebSite","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#website","url":"https:\/\/azure.microsoft.com\/en-us\/blog\/","name":"Microsoft Azure Blog","description":"Get the latest Azure news, updates, and announcements from the Azure blog. From product updates to hot topics, hear from the Azure experts.","publisher":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/azure.microsoft.com\/en-us\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization","name":"Microsoft Azure Blog","url":"https:\/\/azure.microsoft.com\/en-us\/blog\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/06\/microsoft_logo.webp","contentUrl":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/06\/microsoft_logo.webp","width":512,"height":512,"caption":"Microsoft Azure Blog"},"image":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/microsoftazure","https:\/\/x.com\/azure","https:\/\/www.instagram.com\/microsoftdeveloper\/","https:\/\/www.linkedin.com\/company\/16188386","https:\/\/www.youtube.com\/user\/windowsazure"]},{"@type":"Person","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/person\/83fe4c04c61d5e58d555ba137c01a107","name":"Garry Guseltsev","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/secure.gravatar.com\/avatar\/8476ebc2bcbe54e1843bd5cce3ec249bed771194411b3052815d4c5d272128f2?s=96&d=mm&r=g4f09d3e62b774b84289036a84f6a8c1c","url":"https:\/\/secure.gravatar.com\/avatar\/8476ebc2bcbe54e1843bd5cce3ec249bed771194411b3052815d4c5d272128f2?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/8476ebc2bcbe54e1843bd5cce3ec249bed771194411b3052815d4c5d272128f2?s=96&d=mm&r=g","caption":"Garry Guseltsev"},"url":"https:\/\/azure.microsoft.com\/en-us\/blog\/author\/garryguseltsev\/"}]}},"msxcm_display_generated_audio":false,"msxcm_animated_featured_image":null,"distributor_meta":false,"distributor_terms":false,"distributor_media":false,"distributor_original_site_name":"Microsoft Azure Blog","distributor_original_site_url":"https:\/\/azure.microsoft.com\/en-us\/blog","push-errors":false,"_links":{"self":[{"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/posts\/47073","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/users\/76"}],"replies":[{"embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/comments?post=47073"}],"version-history":[{"count":28,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/posts\/47073\/revisions"}],"predecessor-version":[{"id":47353,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/posts\/47073\/revisions\/47353"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/media\/47216"}],"wp:attachment":[{"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/media?parent=47073"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/categories?post=47073"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/tags?post=47073"},{"taxonomy":"audience","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/audience?post=47073"},{"taxonomy":"content-type","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/content-type?post=47073"},{"taxonomy":"product","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/product?post=47073"},{"taxonomy":"tech-community","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/tech-community?post=47073"},{"taxonomy":"topic","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/topic?post=47073"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/coauthors?post=47073"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}