Technology

Nvidia details efficiency of the NVFP4 format for LLM training — new paper reveals how NVFP4 offers benefits over FP8 and BF16

Source

Tom's Hardware

Published

Oct 3, 2025

TL;DR

AI Generated

Nvidia's NVFP4 format, designed for Blackwell GPUs, offers efficiency benefits for both training and inference tasks. The format combines compact data representation with a multi-level scaling strategy, achieving accuracy close to BF16 while reducing memory usage and computational cost. Nvidia successfully trained a 12-billion-parameter model on a 10-trillion-token dataset using NVFP4, closely matching FP8 baseline results. Techniques like mixed precision, consistent scaling, stochastic rounding, and outlier handling were crucial for stable training with 4-bit precision. NVFP4 outperformed the MXFP4 format in convergence and data efficiency, showing promise for training large-scale language models efficiently.

Read Full Article

Framework's new RTX 5070 12GB graphics module costs a whopping $1,199 — 72% more expensive than $699 8GB version, says pricing is beyond its control

Nvidia released a new 12GB version of the RTX 5070 mobile GPU with upgraded memory chips, increasing memory throughput. Framework introduced a new graphics module for its Framework Laptop 16 featuring this GPU, priced at $1,199, a significant increase from the $699 8GB version. The high cost is attributed to the expensive GDDR7 memory and the ongoing global memory shortage. Framework clarified that the pricing is influenced by external factors and not within its control, highlighting the challenges faced by consumers due to the current market conditions.

Tom's Hardware•

2 days ago

Nvidia quietly launches 12GB RTX 5070 laptop GPU — midrange mobile gaming gets more VRAM amid the RAMpocalypse

Nvidia has quietly released a new 12GB RTX 5070 laptop GPU, offering 50% more memory capacity than the existing 8GB version. This new option aims to address the ongoing memory shortage crisis, providing gamers and creators with more VRAM for improved performance. The 12GB RTX 5070 will sit between the 8GB and Ti 12GB versions in Nvidia's mobile GPU lineup, offering enhanced capabilities for AAA gaming and higher graphical settings. This move marks a rare instance of Nvidia offering higher VRAM options amidst industry trends of reducing memory capacities on consumer hardware.

Tom's Hardware•

3 days ago

Nvidia RTX 3060 comeback in 2026 could alleviate soaring GPU prices and memory shortages — rumored RTX 5050 9GB abruptly shelved amid speculation

Nvidia is rumored to be reintroducing the RTX 3060 graphics card in June 2026 to address GPU price hikes and memory shortages. The company has reportedly paused the launch of the RTX 5050 9GB variant, potentially due to these market challenges. The RTX 3060, with its 12GB of GDDR6 memory, could offer a cost-effective solution for gamers, especially with the current scarcity of GPUs. The decision to bring back the RTX 3060 may be influenced by the lower cost of GDDR6 VRAM and the ease of manufacturing using Samsung's 8nm process.

Tom's Hardware•

1 week ago

Nvidia CEO Jensen Huang ‘nearly lost his composure’ when pressed on selling chips to China — ‘You’re not talking to someone who woke up a loser’

Nvidia CEO Jensen Huang engaged in a heated debate about selling chips to China, emphasizing that China already has significant compute power and could develop advanced AI models regardless. He argued that restricting Nvidia's chips would not prevent China's AI development and could lead to a fragmented tech ecosystem. Huang stressed the importance of maintaining innovation and nurturing the tech ecosystem to compete effectively. He highlighted the complexity of AI technology, emphasizing the need for all layers of the industry to succeed, not just focusing on one aspect.

Tom's Hardware•

1 week ago

Nvidia details efficiency of the NVFP4 format for LLM training — new paper reveals how NVFP4 offers benefits over FP8 and BF16

TL;DR

Similar Articles

Framework's new RTX 5070 12GB graphics module costs a whopping $1,199 — 72% more expensive than $699 8GB version, says pricing is beyond its control

Nvidia quietly launches 12GB RTX 5070 laptop GPU — midrange mobile gaming gets more VRAM amid the RAMpocalypse

Nvidia RTX 3060 comeback in 2026 could alleviate soaring GPU prices and memory shortages — rumored RTX 5050 9GB abruptly shelved amid speculation

Nvidia CEO Jensen Huang ‘nearly lost his composure’ when pressed on selling chips to China — ‘You’re not talking to someone who woke up a loser’

We use cookies