ETC: Extreme Token Compression via Task-aware Visual Information Distillation in VLMs 事件

PRODUCT_LAUNCH2026-06-02影响: MEDIUM

ETC: Extreme Token Compression via Task-aware Visual Information Distillation in VLMs arXiv:2606.00543v1 Announce Type: new Abstract: In Vision-Language Models (VLMs), high-resolution images produce a large number of visual tokens, resulting in high computational costs and KV-cache overhead during inference. To address this problem, we propose an Extreme Token Compression (ETC) framework that minimizes task loss when reducing the number of input tokens based on the principle of variational info