How Much Progress Has There Been in NVIDIA Datacenter GPUs? 文章

ArXiv CS.AI2026-06-02NEWSen作者: Emanuele Del Sozzo, Martin Fleming, Kenneth Flamm, Neil Thompson

详细信息

来源站点: ArXiv CS.AI
作者: Emanuele Del Sozzo, Martin Fleming, Kenneth Flamm, Neil Thompson
文章类型: NEWS
语言: en
发布日期: 2026-06-02

摘要

arXiv:2601.20115v3 Announce Type: replace-cross Abstract: As the role of modern Graphics Processing Units (GPUs) becomes increasingly essential for several computing tasks, analyzing their past and current progress is paramount for determining future constraints on scientific research. This is particularly compelling in the Artificial Intelligence (AI) domain, where rapid technological advancements and fierce global competition have led the United States to recently implement export control regulations limiting international access to advanced AI chips. Consequently, this paper examines technical progress in NVIDIA datacenter GPUs from the mid-2000s through 2025. Our main results identify doubling times of 1.43 and 1.67 years for FP16 and FP32 dense operations, while FP64 doubling times range from 2.05 to 3.79 years. Off-chip memory size and bandwidth have grown at slower rates than computing performance, doubling every 3.29 to 3.

How Much Progress Has There Been in NVIDIA Datacenter GPUs? 文章

详细信息

摘要

相关事件

相关公司查看全部 (2)

相关人物

相关产品查看全部 (1)

相关技术查看全部 (6)