Learnable Token Sparsification for Efficient Gigapixel Whole Slide Image Reasoning 事件

PRODUCT_LAUNCH2026-06-09影响: MEDIUM

Learnable Token Sparsification for Efficient Gigapixel Whole Slide Image Reasoning arXiv:2606.08641v1 Announce Type: new Abstract: The processing of gigapixel whole slide images within vision language models faces a major difficulty due to an excessive number of visual tokens. Existing solutions typically rely on spatial downsampling or heuristic pruning strategies that operate without training, and these methods often discard subtle but clinically meaningful patterns because pathological evide