Learning from Fine-Grained Visual Discrepancies: Mitigating Multimodal Hallucinations via In-Context Visual Contrastive Optimization 事件

PRODUCT_LAUNCH2026-06-01影响: MEDIUM

Learning from Fine-Grained Visual Discrepancies: Mitigating Multimodal Hallucinations via In-Context Visual Contrastive Optimization arXiv:2605.31312v1 Announce Type: new Abstract: Multimodal hallucination remains a persistent challenge for Vision-Language Models (VLMs). Standard textual Direct Preference Optimization (DPO) often fails to mitigate it due to a lack of explicit visual supervision. While existing works introduce visual preference DPO by contrasting original images against negative

Learning from Fine-Grained Visual Discrepancies: Mitigating Multimodal Hallucinations via In-Context Visual Contrastive Optimization · 相关公司

I
ISCCOMPANY
A
arXivNONPROFIT
G
GLENONPROFIT
I
IRECNONPROFIT
E
EARNNONPROFIT
S
SHARENONPROFIT
A
ACTNONPROFIT
V
VIACOMPANY