close
close

Nvidia wants to increase AI capabilities with an energy-efficient R100 chip in 2025

Nvidia, a leader in the graphics processing unit (GPU) industry, is gearing up to release its next-generation AI chip, the R100, which is expected to begin mass production in the last quarter of 2025. According to industry analyst Guo Mingyi, the highly anticipated chip will deliver significant advances in AI computing power while addressing critical energy consumption issues encountered in AI server operations and data center construction.

The R100 is expected to utilize Taiwan Semiconductor Manufacturing Company’s (TSMC) N3 process technology, which includes the advanced CoWoS-L packaging method, a technique also found in Nvidia’s recently launched B100 Blackwell chip. However, the R100 will distinguish itself with a larger interposer and a reticle design that is approximately four times larger than its predecessor, suggesting substantial improvements in both scale and performance.

Each R100 chip will consist of four smaller reticles, creating a unified, larger processor. This design also includes eight units of the latest HBM4 memory, making the R100 a formidable player in the high-performance computing (HPC) and artificial intelligence sectors. Additionally, Nvidia plans to pair the R100 with its GR200 Grace CPU, which will also be produced using TSMC’s N3 process technology, an upgrade from the N5 technology currently used in the GH200 and GB200 Grace superchips.

The shift toward improved energy efficiency is a strategic move by Nvidia as it responds to growing concerns about the operational costs associated with AI technologies. Previous generations of Nvidia’s AI chips, such as the H100, required significant power, training a 1.8 trillion parameter model that consumed up to 15 megawatts of electricity. In contrast, Nvidia’s Blackwell GPUs have dramatically reduced that consumption to just 4 megawatts, illustrating a significant reduction in energy needs.

This focus on energy efficiency does not come at the expense of processing power. The R100 is poised to continue Nvidia’s legacy of rapidly advancing AI capabilities, with the B100 marking a thousandfold increase in computing power over an eight-year period. The R100 aims to push these boundaries even further and meet the intensive needs of modern AI applications, from autonomous vehicles to complex data analysis.

© 2024 Business Times All rights reserved. Do not reproduce without permission.