{"id":47052,"date":"2025-10-09T09:00:00","date_gmt":"2025-10-09T16:00:00","guid":{"rendered":""},"modified":"2025-10-09T09:23:17","modified_gmt":"2025-10-09T16:23:17","slug":"microsoft-azure-delivers-the-first-large-scale-cluster-with-nvidia-gb300-nvl72-for-openai-workloads","status":"publish","type":"post","link":"https:\/\/azure.microsoft.com\/en-us\/blog\/microsoft-azure-delivers-the-first-large-scale-cluster-with-nvidia-gb300-nvl72-for-openai-workloads\/","title":{"rendered":"Microsoft Azure delivers the first large scale cluster with NVIDIA GB300 NVL72 for OpenAI workloads"},"content":{"rendered":"\n<p class=\"wp-block-paragraph\">Microsoft\u202fdelivers\u202fthe <strong>first at-scale production cluster with more than 4,600 NVIDIA GB300 NVL72, featuring NVIDIA Blackwell Ultra GPUs connected through the next-generation NVIDIA InfiniBand network<\/strong>. This cluster is the first of\u202fmany,\u202fas we scale\u202fto hundreds of thousands of Blackwell Ultra GPUs <a href=\"https:\/\/blogs.microsoft.com\/blog\/2025\/09\/18\/inside-the-worlds-most-powerful-ai-datacenter\/\" target=\"_blank\" rel=\"noreferrer noopener\">deployed across Microsoft\u2019s AI\u202fdatacenters<\/a> globally, reflecting our continued commitment to redefining AI infrastructure and collaboration with NVIDIA. The massive scale clusters with Blackwell Ultra GPUs will enable\u202fmodel training in weeks instead of months,\u202fdelivering high throughput for inference workloads. We are also unlocking bigger, more powerful models, and will be the first to support training models with hundreds of trillions of parameters.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">This was made possible through collaboration across hardware, systems, supply chain, facilities, and multiple other disciplines, as well as with NVIDIA.<\/p>\n\n\n\n<div class=\"wp-block-buttons is-content-justification-center is-layout-flex wp-container-core-buttons-is-layout-a89b3969 wp-block-buttons-is-layout-flex\">\n<div class=\"wp-block-button\"><a class=\"wp-block-button__link wp-element-button\" href=\"https:\/\/azure.microsoft.com\/solutions\/high-performance-computing\/ai-infrastructure\/\" target=\"_blank\" rel=\"noreferrer noopener\">Power groundbreaking AI innovation with Azure AI Infrastructure<\/a><\/div>\n<\/div>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p class=\"has-large-font-size wp-block-paragraph\">Microsoft Azure&#8217;s launch of the NVIDIA GB300 NVL72 supercluster is an exciting step in the advancement of frontier AI. This co-engineered system delivers the world&#8217;s first at-scale GB300 production cluster, providing the supercomputing engine needed for OpenAI to serve multitrillion-parameter models. This sets the definitive new standard for accelerated computing.<\/p>\n<cite>Ian Buck, Vice President of Hyperscale and High-performance Computing at NVIDIA<\/cite><\/blockquote>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"from-nvidia-gb200-to-gb300-a-new-standard-in-ai-performance\">From NVIDIA GB200 to GB300: A new standard in AI performance<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">Earlier this year, <a href=\"https:\/\/techcommunity.microsoft.com\/blog\/azurehighperformancecomputingblog\/accelerating-the-intelligence-age-with-azure-ai-infrastructure-and-the-ga-of-nd-\/4394575\" target=\"_blank\" rel=\"noreferrer noopener\">Azure introduced ND GB200 v6 virtual machines (VMs)<\/a>, accelerated by NVIDIA&#8217;s Blackwell architecture. These quickly became the backbone of some of the most demanding AI workloads in the industry, including for organizations like OpenAI and Microsoft who already use massive clusters of GB200 NVL2 on Azure to train and deploy frontier models.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Now, with ND GB300 v6 VMs, Azure is raising the bar again. These VMs are optimized for reasoning models, agentic AI systems, and multimodal generative AI. Built on a rack-scale system, each rack has 18 VMs with a total of 72 GPUs:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li class=\"wp-block-list-item\">72 NVIDIA Blackwell Ultra GPUs (with 36 NVIDIA Grace CPUs).<\/li>\n\n\n\n<li class=\"wp-block-list-item\">800 gigabits per second (Gbp\/s) per GPU cross-rack scale-out bandwidth via next-generation NVIDIA Quantum-X800 InfiniBand (2x GB200 NVL72).<\/li>\n\n\n\n<li class=\"wp-block-list-item\">130 terabytes (TB) per second of NVIDIA NVLink bandwidth within rack.<\/li>\n\n\n\n<li class=\"wp-block-list-item\">37TB of fast memory.<\/li>\n\n\n\n<li class=\"wp-block-list-item\">Up to 1,440 petaflops (PFLOPS) of FP4 Tensor Core performance.<\/li>\n<\/ul>\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" src=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2025\/10\/a-close-up-of-a-machine-ai-generated-content-may-3.webp\" alt=\"Close up of&nbsp;Azure server featuring NVIDIA GB300 NVL72, with Blackwell Ultra GPUs.\" class=\"wp-image-47067 webp-format\" data-orig-src=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2025\/10\/a-close-up-of-a-machine-ai-generated-content-may-3.webp\"><\/figure>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"building-for-ai-supercomputing-at-scale\">Building for AI supercomputing at scale<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">Building infrastructure for frontier AI requires us to reimagine every layer of the stack\u2014computing, memory, networking, datacenters, cooling, and power\u2014as a unified system. The ND GB300 v6 VMs are a clear representation of this transformation, from years of collaboration across silicon, systems, and software.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">At the rack level, NVLink and NVSwitch reduce memory and bandwidth constraints, enabling up to 130TB per second of intra-rack data-transfer connecting 37TB total of fast memory. Each rack becomes a tightly coupled unit, delivering higher inference throughput at reduced latencies on larger models and longer context windows, empowering agentic and multimodal AI systems to be more responsive and scalable than ever.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">To scale beyond the rack, Azure deploys a full fat-tree, non-blocking architecture using NVIDIA Quantum-X800 Gbp\/s InfiniBand, the fastest networking fabric available today. This ensures that customers can scale up training of ultra-large models efficiently to tens of thousands of GPUs with minimal communication overhead, thus delivering better end-to-end training throughput. Reduced synchronization overhead also translates to maximum utilization of GPUs, which helps researchers iterate faster and at lower costs despite the compute-hungry nature of AI training workloads. Azure\u2019s co-engineered stack, including custom protocols, collective libraries, and in-network computing, ensures the network is highly reliable and fully utilized by the applications. Features like NVIDIA SHARP accelerate collective operations and double effective bandwidth by performing math in the switch, making large-scale training and inference more efficient and reliable.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Azure\u2019s advanced cooling systems use standalone heat exchanger units and facility cooling to minimize water usage while maintaining thermal stability for dense, high-performance clusters like GB300 NVL72. We also continue to develop and deploy new power distribution models capable of supporting the high energy density and dynamic load balancing required by the ND GB300 v6 VM class of GPU clusters.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Further, our reengineered software stacks for storage, orchestration, and scheduling are optimized to fully use computing, networking, storage, and datacenter infrastructure at supercomputing scale, delivering unprecedented levels of performance at high efficiency to our customers. <\/p>\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" src=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2025\/10\/a-machine-with-wires-and-wires-ai-generated-conte-2.webp\" alt=\"Server blade from a rack featuring NVIDIA GB300 NVL72 in Azure AI infrastructure.\" class=\"wp-image-47068 webp-format\" data-orig-src=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2025\/10\/a-machine-with-wires-and-wires-ai-generated-conte-2.webp\"><\/figure>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"looking-ahead\">Looking ahead<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">Microsoft has invested in AI infrastructure for years, to allow for fast enablement and transition into the newest technology. It is also why <a href=\"https:\/\/azure.microsoft.com\/en-us\/solutions\/high-performance-computing\/ai-infrastructure\/\">Azure<\/a> is uniquely positioned to deliver GB300 NVL72 infrastructure at production scale at a rapid pace, to meet the demands of frontier AI today.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">As Azure continues to ramp up GB300 worldwide deployments, customers can expect to train and deploy new models in a fraction of the time compared to previous generations. The ND GB300 v6 VMs v6 are poised to become the new standard for AI infrastructure, and Azure is proud to lead the way, supporting customers to advance frontier AI development.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Stay tuned for more updates and performance benchmarks as Azure expands production deployment of NVIDIA GB300 NVL72 globally.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><a href=\"https:\/\/blogs.nvidia.com\/blog\/microsoft-azure-worlds-first-gb300-nvl72-supercomputing-cluster-openai\/\" target=\"_blank\" rel=\"noreferrer noopener\">Read more from NVIDIA here.<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Microsoft\u202fdelivers\u202fthe first at-scale production cluster with more than 4,600 NVIDIA GB300 NVL72, featuring NVIDIA Blackwell Ultra GPUs connected through the next-generation NVIDIA InfiniBand network.<\/p>\n","protected":false},"author":41,"featured_media":47094,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"ms_queue_id":["aiblog-content-sync"],"ep_exclude_from_search":false,"_classifai_error":"","_classifai_text_to_speech_error":"","_alt_title":"","footnotes":"","msx_community_cta_settings":[]},"categories":[1467],"tags":[2671,1819],"audience":[3054,3055,3053,3056],"content-type":[1465,1497],"product":[1455],"tech-community":[2977],"topic":[],"coauthors":[235,2701],"class_list":["post-47052","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-compute","tag-ai","tag-datacenter","audience-business-decision-makers","audience-developers","audience-it-decision-makers","audience-it-implementors","content-type-announcements","content-type-partnerships","product-virtual-machines","review-flag-1680286581-295","review-flag-1-1680286581-825","review-flag-4-1680286581-250","review-flag-microsofts","review-flag-new-1680286579-546","review-flag-vm-1680286585-143"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.2 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>NVIDIA GB300 NVL72: Next-generation AI infrastructure at scale | Microsoft Azure Blog<\/title>\n<meta name=\"description\" content=\"Explore how the NVIDIA GB300 NVL72 delivers high-performance AI infrastructure, global availability, and faster model training on Azure.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/azure.microsoft.com\/en-us\/blog\/microsoft-azure-delivers-the-first-large-scale-cluster-with-nvidia-gb300-nvl72-for-openai-workloads\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"NVIDIA GB300 NVL72: Next-generation AI infrastructure at scale | Microsoft Azure Blog\" \/>\n<meta property=\"og:description\" content=\"Explore how the NVIDIA GB300 NVL72 delivers high-performance AI infrastructure, global availability, and faster model training on Azure.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/azure.microsoft.com\/en-us\/blog\/microsoft-azure-delivers-the-first-large-scale-cluster-with-nvidia-gb300-nvl72-for-openai-workloads\/\" \/>\n<meta property=\"og:site_name\" content=\"Microsoft Azure Blog\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/microsoftazure\" \/>\n<meta property=\"article:published_time\" content=\"2025-10-09T16:00:00+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2025-10-09T16:23:17+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2025\/10\/NVIDIA-GB300-NVL72-for-OpenAI-workloads_Social.png\" \/>\n\t<meta property=\"og:image:width\" content=\"1200\" \/>\n\t<meta property=\"og:image:height\" content=\"628\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Rani Borkar, Nidhi Chappell\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:image\" content=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2025\/10\/NVIDIA-GB300-NVL72-for-OpenAI-workloads_Social.png\" \/>\n<meta name=\"twitter:creator\" content=\"@azure\" \/>\n<meta name=\"twitter:site\" content=\"@azure\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Rani Borkar, Nidhi Chappell\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"4 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/microsoft-azure-delivers-the-first-large-scale-cluster-with-nvidia-gb300-nvl72-for-openai-workloads\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/microsoft-azure-delivers-the-first-large-scale-cluster-with-nvidia-gb300-nvl72-for-openai-workloads\/\"},\"author\":[{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/author\/rani-borkar\/\",\"@type\":\"Person\",\"@name\":\"Rani Borkar\"},{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/author\/nidhi-chappell\/\",\"@type\":\"Person\",\"@name\":\"Nidhi Chappell\"}],\"headline\":\"Microsoft Azure delivers the first large scale cluster with NVIDIA GB300 NVL72 for OpenAI workloads\",\"datePublished\":\"2025-10-09T16:00:00+00:00\",\"dateModified\":\"2025-10-09T16:23:17+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/microsoft-azure-delivers-the-first-large-scale-cluster-with-nvidia-gb300-nvl72-for-openai-workloads\/\"},\"wordCount\":841,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization\"},\"image\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/microsoft-azure-delivers-the-first-large-scale-cluster-with-nvidia-gb300-nvl72-for-openai-workloads\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2025\/10\/Azure_1086471_blog_251008.webp\",\"keywords\":[\"AI\",\"Datacenter\"],\"articleSection\":[\"Compute\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/azure.microsoft.com\/en-us\/blog\/microsoft-azure-delivers-the-first-large-scale-cluster-with-nvidia-gb300-nvl72-for-openai-workloads\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/microsoft-azure-delivers-the-first-large-scale-cluster-with-nvidia-gb300-nvl72-for-openai-workloads\/\",\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/microsoft-azure-delivers-the-first-large-scale-cluster-with-nvidia-gb300-nvl72-for-openai-workloads\/\",\"name\":\"NVIDIA GB300 NVL72: Next-generation AI infrastructure at scale | Microsoft Azure Blog\",\"isPartOf\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/microsoft-azure-delivers-the-first-large-scale-cluster-with-nvidia-gb300-nvl72-for-openai-workloads\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/microsoft-azure-delivers-the-first-large-scale-cluster-with-nvidia-gb300-nvl72-for-openai-workloads\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2025\/10\/Azure_1086471_blog_251008.webp\",\"datePublished\":\"2025-10-09T16:00:00+00:00\",\"dateModified\":\"2025-10-09T16:23:17+00:00\",\"description\":\"Explore how the NVIDIA GB300 NVL72 delivers high-performance AI infrastructure, global availability, and faster model training on Azure.\",\"breadcrumb\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/microsoft-azure-delivers-the-first-large-scale-cluster-with-nvidia-gb300-nvl72-for-openai-workloads\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/azure.microsoft.com\/en-us\/blog\/microsoft-azure-delivers-the-first-large-scale-cluster-with-nvidia-gb300-nvl72-for-openai-workloads\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/microsoft-azure-delivers-the-first-large-scale-cluster-with-nvidia-gb300-nvl72-for-openai-workloads\/#primaryimage\",\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2025\/10\/Azure_1086471_blog_251008.webp\",\"contentUrl\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2025\/10\/Azure_1086471_blog_251008.webp\",\"width\":1260,\"height\":708,\"caption\":\"Server blade from a rack featuring NVIDIA GB300 NVL72 in Azure AI infrastructure.\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/microsoft-azure-delivers-the-first-large-scale-cluster-with-nvidia-gb300-nvl72-for-openai-workloads\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Blog home\",\"item\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Compute\",\"item\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/category\/compute\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"Microsoft Azure delivers the first large scale cluster with NVIDIA GB300 NVL72 for OpenAI workloads\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#website\",\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/\",\"name\":\"Microsoft Azure Blog\",\"description\":\"Get the latest Azure news, updates, and announcements from the Azure blog. From product updates to hot topics, hear from the Azure experts.\",\"publisher\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization\",\"name\":\"Microsoft Azure Blog\",\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/06\/microsoft_logo.webp\",\"contentUrl\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/06\/microsoft_logo.webp\",\"width\":512,\"height\":512,\"caption\":\"Microsoft Azure Blog\"},\"image\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/www.facebook.com\/microsoftazure\",\"https:\/\/x.com\/azure\",\"https:\/\/www.instagram.com\/microsoftdeveloper\/\",\"https:\/\/www.linkedin.com\/company\/16188386\",\"https:\/\/www.youtube.com\/user\/windowsazure\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/person\/2bc315aef2026a1248b3baf2debe42e9\",\"name\":\"Veronica Sun\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/secure.gravatar.com\/avatar\/935bdb80b0630ee71e6b746e2a593e56487b79519091f38921478235bd3ff54d?s=96&d=mm&r=gadeaa9c0cc5940196f11beb666a2c5db\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/935bdb80b0630ee71e6b746e2a593e56487b79519091f38921478235bd3ff54d?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/935bdb80b0630ee71e6b746e2a593e56487b79519091f38921478235bd3ff54d?s=96&d=mm&r=g\",\"caption\":\"Veronica Sun\"},\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/author\/veronicasun\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"NVIDIA GB300 NVL72: Next-generation AI infrastructure at scale | Microsoft Azure Blog","description":"Explore how the NVIDIA GB300 NVL72 delivers high-performance AI infrastructure, global availability, and faster model training on Azure.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/azure.microsoft.com\/en-us\/blog\/microsoft-azure-delivers-the-first-large-scale-cluster-with-nvidia-gb300-nvl72-for-openai-workloads\/","og_locale":"en_US","og_type":"article","og_title":"NVIDIA GB300 NVL72: Next-generation AI infrastructure at scale | Microsoft Azure Blog","og_description":"Explore how the NVIDIA GB300 NVL72 delivers high-performance AI infrastructure, global availability, and faster model training on Azure.","og_url":"https:\/\/azure.microsoft.com\/en-us\/blog\/microsoft-azure-delivers-the-first-large-scale-cluster-with-nvidia-gb300-nvl72-for-openai-workloads\/","og_site_name":"Microsoft Azure Blog","article_publisher":"https:\/\/www.facebook.com\/microsoftazure","article_published_time":"2025-10-09T16:00:00+00:00","article_modified_time":"2025-10-09T16:23:17+00:00","og_image":[{"width":1200,"height":628,"url":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2025\/10\/NVIDIA-GB300-NVL72-for-OpenAI-workloads_Social.png","type":"image\/png"}],"author":"Rani Borkar, Nidhi Chappell","twitter_card":"summary_large_image","twitter_image":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2025\/10\/NVIDIA-GB300-NVL72-for-OpenAI-workloads_Social.png","twitter_creator":"@azure","twitter_site":"@azure","twitter_misc":{"Written by":"Rani Borkar, Nidhi Chappell","Est. reading time":"4 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/microsoft-azure-delivers-the-first-large-scale-cluster-with-nvidia-gb300-nvl72-for-openai-workloads\/#article","isPartOf":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/microsoft-azure-delivers-the-first-large-scale-cluster-with-nvidia-gb300-nvl72-for-openai-workloads\/"},"author":[{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/author\/rani-borkar\/","@type":"Person","@name":"Rani Borkar"},{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/author\/nidhi-chappell\/","@type":"Person","@name":"Nidhi Chappell"}],"headline":"Microsoft Azure delivers the first large scale cluster with NVIDIA GB300 NVL72 for OpenAI workloads","datePublished":"2025-10-09T16:00:00+00:00","dateModified":"2025-10-09T16:23:17+00:00","mainEntityOfPage":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/microsoft-azure-delivers-the-first-large-scale-cluster-with-nvidia-gb300-nvl72-for-openai-workloads\/"},"wordCount":841,"commentCount":0,"publisher":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization"},"image":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/microsoft-azure-delivers-the-first-large-scale-cluster-with-nvidia-gb300-nvl72-for-openai-workloads\/#primaryimage"},"thumbnailUrl":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2025\/10\/Azure_1086471_blog_251008.webp","keywords":["AI","Datacenter"],"articleSection":["Compute"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/azure.microsoft.com\/en-us\/blog\/microsoft-azure-delivers-the-first-large-scale-cluster-with-nvidia-gb300-nvl72-for-openai-workloads\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/microsoft-azure-delivers-the-first-large-scale-cluster-with-nvidia-gb300-nvl72-for-openai-workloads\/","url":"https:\/\/azure.microsoft.com\/en-us\/blog\/microsoft-azure-delivers-the-first-large-scale-cluster-with-nvidia-gb300-nvl72-for-openai-workloads\/","name":"NVIDIA GB300 NVL72: Next-generation AI infrastructure at scale | Microsoft Azure Blog","isPartOf":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/microsoft-azure-delivers-the-first-large-scale-cluster-with-nvidia-gb300-nvl72-for-openai-workloads\/#primaryimage"},"image":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/microsoft-azure-delivers-the-first-large-scale-cluster-with-nvidia-gb300-nvl72-for-openai-workloads\/#primaryimage"},"thumbnailUrl":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2025\/10\/Azure_1086471_blog_251008.webp","datePublished":"2025-10-09T16:00:00+00:00","dateModified":"2025-10-09T16:23:17+00:00","description":"Explore how the NVIDIA GB300 NVL72 delivers high-performance AI infrastructure, global availability, and faster model training on Azure.","breadcrumb":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/microsoft-azure-delivers-the-first-large-scale-cluster-with-nvidia-gb300-nvl72-for-openai-workloads\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/azure.microsoft.com\/en-us\/blog\/microsoft-azure-delivers-the-first-large-scale-cluster-with-nvidia-gb300-nvl72-for-openai-workloads\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/microsoft-azure-delivers-the-first-large-scale-cluster-with-nvidia-gb300-nvl72-for-openai-workloads\/#primaryimage","url":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2025\/10\/Azure_1086471_blog_251008.webp","contentUrl":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2025\/10\/Azure_1086471_blog_251008.webp","width":1260,"height":708,"caption":"Server blade from a rack featuring NVIDIA GB300 NVL72 in Azure AI infrastructure."},{"@type":"BreadcrumbList","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/microsoft-azure-delivers-the-first-large-scale-cluster-with-nvidia-gb300-nvl72-for-openai-workloads\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Blog home","item":"https:\/\/azure.microsoft.com\/en-us\/blog\/"},{"@type":"ListItem","position":2,"name":"Compute","item":"https:\/\/azure.microsoft.com\/en-us\/blog\/category\/compute\/"},{"@type":"ListItem","position":3,"name":"Microsoft Azure delivers the first large scale cluster with NVIDIA GB300 NVL72 for OpenAI workloads"}]},{"@type":"WebSite","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#website","url":"https:\/\/azure.microsoft.com\/en-us\/blog\/","name":"Microsoft Azure Blog","description":"Get the latest Azure news, updates, and announcements from the Azure blog. From product updates to hot topics, hear from the Azure experts.","publisher":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/azure.microsoft.com\/en-us\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization","name":"Microsoft Azure Blog","url":"https:\/\/azure.microsoft.com\/en-us\/blog\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/06\/microsoft_logo.webp","contentUrl":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/06\/microsoft_logo.webp","width":512,"height":512,"caption":"Microsoft Azure Blog"},"image":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/microsoftazure","https:\/\/x.com\/azure","https:\/\/www.instagram.com\/microsoftdeveloper\/","https:\/\/www.linkedin.com\/company\/16188386","https:\/\/www.youtube.com\/user\/windowsazure"]},{"@type":"Person","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/person\/2bc315aef2026a1248b3baf2debe42e9","name":"Veronica Sun","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/secure.gravatar.com\/avatar\/935bdb80b0630ee71e6b746e2a593e56487b79519091f38921478235bd3ff54d?s=96&d=mm&r=gadeaa9c0cc5940196f11beb666a2c5db","url":"https:\/\/secure.gravatar.com\/avatar\/935bdb80b0630ee71e6b746e2a593e56487b79519091f38921478235bd3ff54d?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/935bdb80b0630ee71e6b746e2a593e56487b79519091f38921478235bd3ff54d?s=96&d=mm&r=g","caption":"Veronica Sun"},"url":"https:\/\/azure.microsoft.com\/en-us\/blog\/author\/veronicasun\/"}]}},"msxcm_display_generated_audio":false,"msxcm_animated_featured_image":null,"distributor_meta":false,"distributor_terms":false,"distributor_media":false,"distributor_original_site_name":"Microsoft Azure Blog","distributor_original_site_url":"https:\/\/azure.microsoft.com\/en-us\/blog","push-errors":false,"_links":{"self":[{"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/posts\/47052","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/users\/41"}],"replies":[{"embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/comments?post=47052"}],"version-history":[{"count":50,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/posts\/47052\/revisions"}],"predecessor-version":[{"id":47166,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/posts\/47052\/revisions\/47166"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/media\/47094"}],"wp:attachment":[{"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/media?parent=47052"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/categories?post=47052"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/tags?post=47052"},{"taxonomy":"audience","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/audience?post=47052"},{"taxonomy":"content-type","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/content-type?post=47052"},{"taxonomy":"product","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/product?post=47052"},{"taxonomy":"tech-community","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/tech-community?post=47052"},{"taxonomy":"topic","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/topic?post=47052"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/coauthors?post=47052"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}