Microsoft Azure Unveils Nvidia GB300 NVL72 Cluster Built for OpenAI’s AI Workloads

Microsoft said it has delivered more than 4,600 Nvidia GB300 NVL72, featuring its Blackwell Ultra GPUs.

Advertisement
Written by Akash Dutta, Edited by Rohan Pal | Updated: 10 October 2025 15:46 IST
Highlights
  • The cluster is connected through the Nvidia InfiniBand network
  • Microsoft has procured a total of 72 Nvidia Blackwell Ultra GPUs
  • The entire architecture is built on a rack-scale system

Microsoft said it has delivered more than 4,600 Nvidia GB300 NVL72, featuring its Blackwell Ultra GPUs.

Photo Credit: Microsoft

Microsoft Azure unveiled the new NDv6 GB300 virtual machine (VM) series on Thursday. Claimed to be the industry's first supercomputing-scale production cluster of the Nvidia GB300 NVL72 systems, it will be made available for OpenAI's “most demanding artificial intelligence (AI) inference workloads.” The Redmond-based tech giant says these VMs are optimised for reasoning models, agentic AI systems, and multimodal generative AI workflows. Interestingly, with the new architecture, Azure has upgraded from ND GB200 v6 VMs, which were introduced less than a year ago.

Microsoft Azure Upgrades Cloud Computing Stack With New Nvidia Hardware

In a blog post, Microsoft's cloud division, Azure, announced the creation of its new virtual machines. The cluster is powered by more than 4,600 Nvidia GB300 NVL72 systems, which feature the company's Blackwell Ultra GPUs connected via its InfiniBand network. Microsoft claims that the cluster will enable model training in weeks instead of months and deliver high throughput for inference workloads. It is said to support training models with “hundreds of trillions of parameters.”

Breaking down the system, the cloud division has implemented a rack-scale architecture, where each rack contains 18 virtual machines with a total of 72 GPUs and 36 Nvidia Grace CPUs. Each GPU can communicate at 800GBps per GPU via Nvidia's Quantum-X800 InfiniBand, which uses two GB200 NVL72 systems.

Advertisement

Inside each rack, the chips are connected with ultra-fast links that can move 130TB of data per second. There's a huge 37TB of very fast memory to handle massive calculations. Overall, it can perform up to 1,440 petaflops (PFLOPS) of AI calculations per second using FP4 Tensor Cores, making it one of the fastest systems in the world for AI tasks.

Advertisement

Within each rack, NVLink and NVSwitch, special high-speed connections that let GPUs talk to each other extremely quickly, allow 37TB of memory to exchange data at up to 130TB per second. This tight integration means AI models can process larger tasks faster, handle longer sequences of information, and run complex agentic (AI that can make decisions on its own) or multimodal (AI that can process multiple types of data like text, images, and audio together) workloads with minimal delays.

Microsoft says to expand beyond a single rack, Azure uses a full fat-tree, non-blocking network, a networking design that ensures all racks can communicate without slowdowns, powered by InfiniBand. This allows AI training to scale efficiently across tens of thousands of GPUs while keeping communication delays minimal. By reducing synchronisation overhead (the time GPUs spend waiting for each other), GPUs spend more time computing, helping researchers train massive AI models faster and at lower cost.

Advertisement

Azure's co-designed stack combines custom protocols, collective libraries, and in-network computing to ensure the network is reliable and fully utilised. Additionally, Microsoft's cooling systems use standalone heat exchanger units along with facility cooling to reduce water use. On the software side, the company says it has reengineered stacks for storage, orchestration, and scheduling.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement

Related Stories

Popular Mobile Brands
  1. OnePlus 15R Will Launch in India on This Date Alongside Pad Go 2
  2. Realme C85 5G Will Launch in India on This Date
  3. Huawei Watch GT 6, Watch GT 6 Pro Launched in India At This Price
  4. iQOO 15: Everything You Need to Know Ahead of Launch in India
  5. Honor 500 Pro, Honor 500 Launched With 8,000mAh Battery: See Price
  6. Moto G57 Power With 50-Megapixel Sony LYT-600 Camera Launched in India
  7. Black Friday Sale: Check Discounts on These iPhone 16 Models on Vijay Sales
  8. Red Magic 11 Air Listed on Chinese Regulator's Website With These Features
  9. Vivo S50 Series Confirmed to Launch Soon With This Sony Sensor
  10. NASA's Perseverance Rover Finds Metal-Rich Rock on Mars: What You Need to Know
  1. NASA’s Perseverance Rover Finds Metal-Rich Rock on Mars: What You Need to Know
  2. ISS Experiment Shows Moss Spores Can Survive Harsh Space Environment
  3. Asteroid 2024 YR4: Earth Safe, but New Data Shows Small 2032 Lunar Impact Risk
  4. Stephen OTT Release Date: When and Where to Watch it Online?
  5. Kuttram Purindhavam OTT Release Date: When and Where to Watch it Online?
  6. Sreejith Lal’s Malayalam Film Inland Now Streaming on ManoramaMAX
  7. The Great Pre-Wedding Show OTT Release Date: Know Where to Watch This Telugu Comedy-Drama Online
  8. Nadu Center Season 1 Now Streaming on JioHotstar: Everything You Need to Know About this Inspiring Tamil Sports Drama
  9. Aaryan OTT Release: Know Everything About Streaming, Plot, Cast, and More
  10. Sasivadane OTT Release Date: When and Where to Watch This Telugu Romantic Drama Online?
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.