Understanding Quantization-Aware Training: Gradients at Quantized Weights Bias to the Low-Loss Basin 文章

ArXiv CS.AI2026-06-09NEWSen作者: Hanyang Li, Jianhao Ma, Ying Cui

详细信息

来源站点: ArXiv CS.AI
作者: Hanyang Li, Jianhao Ma, Ying Cui
文章类型: NEWS
语言: en
发布日期: 2026-06-09

摘要

arXiv:2606.09012v1 Announce Type: cross Abstract: Post-training quantization (PTQ) converts a trained full-precision model into low-bit weights without task-level retraining, while quantization-aware training (QAT) incorporates quantization into the training loop. Although PTQ is efficient and often accurate at moderate bitwidths, it can fail sharply at aggressive bitwidths; QAT is more expensive but can often recover the lost accuracy. We propose a unified geometric framework that explains both PTQ failure and QAT recovery. We model full-precision training as following a low-loss \emph{river} inside a wider \emph{valley}: a normal neighborhood of the river forms a nearly flat \emph{basin}, while leaving this basin incurs a sharp loss increase.

Understanding Quantization-Aware Training: Gradients at Quantized Weights Bias to the Low-Loss Basin 文章

详细信息

摘要

相关事件

相关公司

相关人物

相关产品

相关技术查看全部 (5)