Back to home
Technology

Microsoft deploys world's first 'supercomputer-scale' GB300 NVL72 Azure cluster — 4,608 GB300 GPUs linked together to form a single, unified accelerator capable of 92.1 exaFLOPS of FP4 inference

Source

Tom's Hardware

Published

TL;DR

AI Generated

Microsoft has introduced the world's first large-scale GB300 NVL72 supercomputing cluster on its Azure cloud platform, featuring 4,608 GB300 GPUs connected by NVLink 5 switch fabric and Nvidia’s Quantum-X800 InfiniBand networking fabric. Each rack in the cluster boasts a memory bandwidth of 130 TB/s and 800 Gb/s of interconnect bandwidth per GPU. The cluster is dedicated to OpenAI workloads, enabling faster model training and deployment for advanced reasoning models. Nvidia's Blackwell Ultra GPUs and Microsoft's Azure platform are at the forefront of this deployment, with plans for more clusters worldwide.

Read Full Article

Similar Articles

China announces CPU-only exascale supercomputer with 47,000 homemade processors, record 2 Exaflops of performance without GPUs — Lingshen super said to use Huawei Kunpeng servers and no foreign-made components

China announces CPU-only exascale supercomputer with 47,000 homemade processors, record 2 Exaflops of performance without GPUs — Lingshen super said to use Huawei Kunpeng servers and no foreign-made components

China's National Supercomputing Center in Shenzhen unveiled the Lingshen supercomputer project, aiming for over 2 ExaFLOPS performance using 47,000 homemade processors without GPUs or foreign components. The system, designed to surpass the current fastest supercomputer, El Capitan, would utilize Huawei Kunpeng servers and Arm-based Taishan cores. The project includes a pilot phase with 100 servers and a full production system with 1,580 blade servers. While China's claims of achieving 2+ ExaFLOPS are ambitious, questions remain about the feasibility of surpassing existing supercomputing benchmarks without GPUs or foreign-made CPUs.

Tom's Hardware
Intel has reportedly cancelled discrete gaming GPUs for the upcoming Xe3P Arc "Celestial" family — gaming GPU remains uncertain even for the next-gen Xe4 "Druid" lineup that lands in 2027

Intel has reportedly cancelled discrete gaming GPUs for the upcoming Xe3P Arc "Celestial" family — gaming GPU remains uncertain even for the next-gen Xe4 "Druid" lineup that lands in 2027

Intel has reportedly scrapped plans for discrete gaming GPUs in the upcoming Xe3P Arc "Celestial" family, leaving the fate of gaming GPUs uncertain even for the Xe4 "Druid" lineup expected in 2027. The Celestial GPU was originally intended for a 2025 launch but was replaced by Battlemage, with Xe3P now serving other purposes. Intel's focus seems to be shifting towards AI applications, with leaks suggesting a potential late-2027 release for the Druid architecture. The future of dedicated gaming GPUs from Intel remains speculative, with the possibility of a revival with the Druid lineup.

Tom's Hardware
SpaceX says it is going to begin manufacturing GPUs — $1.75 trillion IPO listing reportedly includes in-house GPU production

SpaceX says it is going to begin manufacturing GPUs — $1.75 trillion IPO listing reportedly includes in-house GPU production

SpaceX's confidential $1.75 trillion IPO filing reveals plans to manufacture its own GPUs, investing billions in internal processor production due to a lack of long-term supply agreements with silicon suppliers. The company's intention to build GPUs, not specialized AI accelerators, is highlighted, with the naming convention still uncertain. While SpaceX's CEO confirmed plans for high-volume semiconductor manufacturing, the specifics of the GPUs remain unclear, raising questions about potential competition with existing AI GPU manufacturers like AMD and Nvidia. The S-1 form's confidential nature prevents verification of its content, leaving room for speculation on SpaceX's semiconductor endeavors.

Tom's Hardware
Testing DirectStorage with GPU decompression — do Blackwell GPUs have the upper hand?

Testing DirectStorage with GPU decompression — do Blackwell GPUs have the upper hand?

The article discusses the testing of DirectStorage with GPU decompression, focusing on whether Blackwell GPUs have an advantage in handling this technology. DirectStorage aims to optimize storage technology for faster asset streaming and reduced CPU overhead, with support for GPU decompression added in version 1.1. While Nvidia GPUs initially struggled with DirectStorage, Blackwell GPUs, like the 5090, showed improved performance with GPU decompression enabled. Tests on various Blackwell GPUs, including the 5070 and 5060, demonstrated consistent performance gains with DirectStorage. The article explores the potential reasons behind Blackwell GPUs handling GPU decompression more effectively, pointing to advancements in architecture and scheduling capabilities.

Tom's Hardware

We use cookies

We use cookies to ensure you get the best experience on our website. For more information on how we use cookies, please see our cookie policy.