Understanding Quantization-Aware Training: Gradients at Quantized Weights Bias to the Low-Loss Basin 文章

ArXiv CS.AI2026-06-09NEWSen作者: Hanyang Li, Jianhao Ma, Ying Cui

详细信息

来源站点
ArXiv CS.AI
作者
Hanyang Li, Jianhao Ma, Ying Cui
文章类型
NEWS
语言
en
发布日期
2026-06-09

摘要

arXiv:2606.09012v1 Announce Type: cross Abstract: Post-training quantization (PTQ) converts a trained full-precision model into low-bit weights without task-level retraining, while quantization-aware training (QAT) incorporates quantization into the training loop. Although PTQ is efficient and often accurate at moderate bitwidths, it can fail sharply at aggressive bitwidths; QAT is more expensive but can often recover the lost accuracy. We propose a unified geometric framework that explains both PTQ failure and QAT recovery. We model full-precision training as following a low-loss \emph{river} inside a wider \emph{valley}: a normal neighborhood of the river forms a nearly flat \emph{basin}, while leaving this basin incurs a sharp loss increase.

相关事件

暂无数据

相关公司

暂无数据

相关人物

暂无数据

相关产品

暂无数据