Selective Coupling of Decoupled Informative Regions: Masked Attention Alignment for Data-Free Quantization of Vision Transformers 文章

ArXiv CS.CV2026-06-08NEWSen作者: Biao Qian, Yang Wang, Yong Wu, Jungong Han

详细信息

来源站点
ArXiv CS.CV
作者
Biao Qian, Yang Wang, Yong Wu, Jungong Han
文章类型
NEWS
语言
en
发布日期
2026-06-08

摘要

arXiv:2606.04373v2 Announce Type: replace Abstract: Data-Free Quantization (DFQ) addresses data security concerns by synthesizing samples, without accessing real data. It has garnered increasing attention in the context of Vision Transformers (ViTs), owing to the superiority of the self-attention mechanism compared to classical convolutional operation. However, previous DFQ arts for ViTs often suffer from a distribution mismatch between synthetic samples and input distribution expected by quantized models Q, resulting in the suboptimal performance. In this paper, we propose a novel Masked Attention Alignment approach for Data-Free Quantization of ViTs, named MaskAQ, revealing that: 1) the semantics in the self-attention mechanism is predominantly localized to a sparse subset of patches, called informative regions; 2) the informative regions dominate the mutual information between synthetic samples and Q's outputs.

相关事件

暂无数据

相关公司

暂无数据

相关人物

暂无数据