Back to home
Technology

Utilizing Chiplet-Locality For Efficient Memory Mapping In MCM GPUs (ETRI, Sungkyunkwan Univ.)

Source

SemiEngineering

Published

TL;DR

AI Generated

Researchers from ETRI and Sungkyunkwan University published a technical paper on leveraging chiplet-locality for efficient memory mapping in multi-chip module GPUs. The study explores how page size in memory mapping impacts memory system non-uniformity in MCM GPUs. They introduce CLAP, a method to determine the suitable page size for each application based on chiplet-locality patterns. CLAP optimizes data placement within chiplets, improving performance by up to 19.2% compared to previous paging schemes. The paper was presented at the 58th IEEE/ACM International Symposium on Microarchitecture.

Read Full Article

Similar Articles

Intel has reportedly cancelled discrete gaming GPUs for the upcoming Xe3P Arc "Celestial" family — gaming GPU remains uncertain even for the next-gen Xe4 "Druid" lineup that lands in 2027

Intel has reportedly cancelled discrete gaming GPUs for the upcoming Xe3P Arc "Celestial" family — gaming GPU remains uncertain even for the next-gen Xe4 "Druid" lineup that lands in 2027

Intel has reportedly scrapped plans for discrete gaming GPUs in the upcoming Xe3P Arc "Celestial" family, leaving the fate of gaming GPUs uncertain even for the Xe4 "Druid" lineup expected in 2027. The Celestial GPU was originally intended for a 2025 launch but was replaced by Battlemage, with Xe3P now serving other purposes. Intel's focus seems to be shifting towards AI applications, with leaks suggesting a potential late-2027 release for the Druid architecture. The future of dedicated gaming GPUs from Intel remains speculative, with the possibility of a revival with the Druid lineup.

Tom's Hardware
SpaceX says it is going to begin manufacturing GPUs — $1.75 trillion IPO listing reportedly includes in-house GPU production

SpaceX says it is going to begin manufacturing GPUs — $1.75 trillion IPO listing reportedly includes in-house GPU production

SpaceX's confidential $1.75 trillion IPO filing reveals plans to manufacture its own GPUs, investing billions in internal processor production due to a lack of long-term supply agreements with silicon suppliers. The company's intention to build GPUs, not specialized AI accelerators, is highlighted, with the naming convention still uncertain. While SpaceX's CEO confirmed plans for high-volume semiconductor manufacturing, the specifics of the GPUs remain unclear, raising questions about potential competition with existing AI GPU manufacturers like AMD and Nvidia. The S-1 form's confidential nature prevents verification of its content, leaving room for speculation on SpaceX's semiconductor endeavors.

Tom's Hardware
SemiEngineering

Chiplet Standards Aim For Plug-n-Play

Chiplet standards are crucial for creating a marketplace where chiplets can be easily interchanged like LEGOs. Various standards are being developed to ensure interoperability and physical composability of chiplets, including die-to-die interconnect standards like Bunch of Wires (BoW) and Universal Chiplet Interconnect Express (UCIe). These standards cover system architecture, security, power delivery, data semantics, physical placement, testing, and more. Organizations like the Open Compute Project (OCP) are leading efforts to standardize chiplet-related aspects, such as packaging descriptions and system architectures. The goal is to pave the way for a plug-and-play chiplet marketplace, although challenges related to practical and economic factors still exist.

SemiEngineering
Testing DirectStorage with GPU decompression — do Blackwell GPUs have the upper hand?

Testing DirectStorage with GPU decompression — do Blackwell GPUs have the upper hand?

The article discusses the testing of DirectStorage with GPU decompression, focusing on whether Blackwell GPUs have an advantage in handling this technology. DirectStorage aims to optimize storage technology for faster asset streaming and reduced CPU overhead, with support for GPU decompression added in version 1.1. While Nvidia GPUs initially struggled with DirectStorage, Blackwell GPUs, like the 5090, showed improved performance with GPU decompression enabled. Tests on various Blackwell GPUs, including the 5070 and 5060, demonstrated consistent performance gains with DirectStorage. The article explores the potential reasons behind Blackwell GPUs handling GPU decompression more effectively, pointing to advancements in architecture and scheduling capabilities.

Tom's Hardware

We use cookies

We use cookies to ensure you get the best experience on our website. For more information on how we use cookies, please see our cookie policy.