Self-Prophetic Decoding to Unlock Visual Search in LVLMs 事件

PRODUCT_LAUNCH2026-05-28影响: MEDIUM

Self-Prophetic Decoding to Unlock Visual Search in LVLMs arXiv:2605.28741v1 Announce Type: new Abstract: Large Vision-Language Models (LVLMs) are rapidly evolving toward true multimodal reasoning, with visual search representing a concrete instantiation of the thinking-with-images paradigm. However, LVLM visual search faces two key challenges: incompatibility among intrinsic capabilities after post-training, and interference in long multi-step reasoning contexts. To address these, we identify t