We use cookies

We use cookies to ensure you get the best experience on our website. For more information on how we use cookies, please see our cookie policy.

Back to home

Microsoft deploys world's first 'supercomputer-scale' GB300 NVL72 Azure cluster — 4,608 GB300 GPUs linked together to form a single, unified accelerator capable of 92.1 exaFLOPS of FP4 inference

Source

Tom's Hardware

Published

TL;DR

AI Generated

Microsoft has introduced the world's first large-scale GB300 NVL72 supercomputing cluster on its Azure cloud platform, featuring 4,608 GB300 GPUs connected by NVLink 5 switch fabric and Nvidia’s Quantum-X800 InfiniBand networking fabric. Each rack in the cluster boasts a memory bandwidth of 130 TB/s and 800 Gb/s of interconnect bandwidth per GPU. The cluster is dedicated to OpenAI workloads, enabling faster model training and deployment for advanced reasoning models. Nvidia's Blackwell Ultra GPUs and Microsoft's Azure platform are at the forefront of this deployment, with plans for more clusters worldwide.