Back to home
Technology

Google deploys new Axion CPUs and seventh-gen Ironwood TPU — training and inferencing pods beat Nvidia GB300 and shape 'AI Hypercomputer' model

Source

Tom's Hardware

Published

TL;DR

AI Generated

Google Cloud has unveiled new AI-focused instances powered by its Axion CPUs and seventh-gen Ironwood TPUs, designed for training and low-latency inference of large-scale AI models. The Ironwood pods can scale up to 9,216 AI accelerators, surpassing Nvidia's capabilities with 42.5 FP8 ExaFLOPS for training and inference. These pods can be clustered into Google's AI Hypercomputer, integrating compute, storage, and networking for enhanced efficiency. Google also introduced its Axion CPUs, offering up to 50% greater performance and 60% higher energy efficiency compared to modern x86 CPUs, along with other configurations like C4A, N4A, and C4A Metal for various workloads.

Read Full Article

Similar Articles

Dr. L.C. Lu on TSMC Advanced Technology Design Solutions

Dr. L.C. Lu on TSMC Advanced Technology Design Solutions

Dr. L.C. Lu, a key figure at TSMC, focuses on design-technology co-optimization, packaging innovations, and AI-driven methodologies for next-gen semiconductor systems. TSMC emphasizes DTCO and DDCL innovations for scaling from N5 to A14 nodes, with NanoFlex and NanoFlex Pro architectures offering efficiency gains. N2P and N2U nodes incorporate advanced DTCO and power delivery optimizations, with hybrid dual-rail architectures achieving significant energy savings. TSMC collaborates with EDA partners for AI integration, enhancing productivity and design quality. Advanced packaging technologies like CoWoS and SoIC play a crucial role in enabling AI scaling, with memory bandwidth and interconnect performance scaling aggressively. TSMC addresses power delivery and thermal management challenges in AI systems through advanced solutions. TSMC's advancements in design methodologies and AI-driven automation promise improved productivity and scalability in chip-package co-design.

SemiWiki
MIT Technology Review

The Download: a new Christian phone network, and debugging LLMs

A new US phone network for Christians is launching, blocking porn and gender-related content with network-level controls. Goodfire, a San Francisco startup, released Silico, a tool for debugging AI models by allowing users to adjust parameters during training. The National Science Foundation faced mass firings, impacting US science funding and governance. China's AI labs are releasing open-source models, challenging the traditional Silicon Valley approach. Elon Musk admitted using OpenAI models for xAI training, sparking debate on AI ethics and practices.

MIT Technology Review
Mark Zuckerberg says Meta is cutting 8,000 jobs to pay for AI infrastructure — insatiable compute demand means the company can't rule out further headcount reductions

Mark Zuckerberg says Meta is cutting 8,000 jobs to pay for AI infrastructure — insatiable compute demand means the company can't rule out further headcount reductions

Meta CEO Mark Zuckerberg announced at a town hall that the company plans to cut 8,000 jobs due to the increasing costs of AI infrastructure. These layoffs will affect about 10% of Meta's workforce and are linked to the company's expanding AI budget. Zuckerberg mentioned that the focus on AI hardware is diverting funds from employee-related expenses, hinting at possible future reductions. Despite the layoffs, Meta reported strong Q1 earnings, raising questions about the necessity of the job cuts. The move highlights a broader debate about the role of AI in driving layoffs in the tech industry.

Tom's Hardware
Talent over tokens: AI models are becoming more expensive to run, and productivity gains are limited — efficient workers might be the solution to strained budgets

Talent over tokens: AI models are becoming more expensive to run, and productivity gains are limited — efficient workers might be the solution to strained budgets

As AI models become more expensive to run, with costs exceeding those of actual workers, companies are facing strained budgets. Despite the promise of productivity gains through AI deployment, many firms are not seeing the expected returns. High costs associated with AI usage are leading to budget exhaustion, with examples like Uber spending its annual AI budget in a few weeks. As AI spending continues to rise, companies may need to reconsider their reliance on AI and potentially invest in efficient human workers instead.

Tom's Hardware

We use cookies

We use cookies to ensure you get the best experience on our website. For more information on how we use cookies, please see our cookie policy.