ReactBench: A Cause-Driven Benchmark for Multimodal Hallucination via Systematic Evaluation 事件

BREAKTHROUGH2026-05-29影响: HIGH

ReactBench: A Cause-Driven Benchmark for Multimodal Hallucination via Systematic Evaluation arXiv:2605.29579v1 Announce Type: new Abstract: While multimodal large language models (MLLMs) have achieved rapid progress in vision-language understanding, they remain prone to multimodal hallucinations, producing responses that are inconsistent with the visual input. Existing benchmarks predominantly focus on detecting hallucination outcomes rather than evaluating the underlying causes of these failur