EarlyTom: Early Token Compression Completes Fast Video Understanding 事件
PRODUCT_LAUNCH2026-05-29影响: MEDIUM
EarlyTom: Early Token Compression Completes Fast Video Understanding arXiv:2605.30010v1 Announce Type: new Abstract: Video large language models (Video-LLMs) have demonstrated strong capabilities in video understanding tasks. However, their practical deployment is still hindered by the inefficiency introduced by processing massive amounts of visual tokens. Although recent approaches achieve extremely low token retention ratios while maintaining accuracy comparable to full-token baselines, most
相关产品查看全部 (10)
相关报道查看全部 (1)
EarlyTom: Early Token Compression Completes Fast Video Understanding
ArXiv CS.CV2026-05-29