Microsoft deploys world's first 'supercomputer-scale' GB300 NVL72 Azure cluster — 4,608 GB300 GPUs linked together to form a single, unified accelerator capable of 92.1 exaFLOPS of FP4 inference
Source
Published
TL;DR
AI GeneratedMicrosoft has introduced the world's first large-scale GB300 NVL72 supercomputing cluster on its Azure cloud platform, featuring 4,608 GB300 GPUs connected by NVLink 5 switch fabric and Nvidia’s Quantum-X800 InfiniBand networking fabric. Each rack in the cluster boasts a memory bandwidth of 130 TB/s and 800 Gb/s of interconnect bandwidth per GPU. The cluster is dedicated to OpenAI workloads, enabling faster model training and deployment for advanced reasoning models. Nvidia's Blackwell Ultra GPUs and Microsoft's Azure platform are at the forefront of this deployment, with plans for more clusters worldwide.