REVEAL: Reference-Grounded Reasoning for Multimodal Manipulation Detection 事件

PRODUCT_LAUNCH2026-05-28影响: MEDIUM

REVEAL: Reference-Grounded Reasoning for Multimodal Manipulation Detection arXiv:2605.28459v1 Announce Type: new Abstract: Multimodal manipulation detection aims to simultaneously identify forged image--text pairs and localize tampered regions, yet existing methods typically rely on memorizing isolated artifacts and struggle with imperceptible manipulation traces or domain shifts. Inspired by human comparative reasoning, we reformulate this task as a reference-grounded verification problem, whe