Reasoning Matters: Mitigate Hallucination in Multimodal Large Reasoning Models via Reasoning-Conditioned Preference Optimization 事件

PRODUCT_LAUNCH2026-05-28影响: MEDIUM

Reasoning Matters: Mitigate Hallucination in Multimodal Large Reasoning Models via Reasoning-Conditioned Preference Optimization arXiv:2605.27906v1 Announce Type: new Abstract: Multimodal Large Reasoning Models introduce the reasoning paradigm, demonstrating strong capabilities on complex vision-language tasks. However, they still suffer from severe hallucinations. Existing training-based methods typically mitigate hallucinations through response-level direct preference optimization (DPO), wher

Reasoning Matters: Mitigate Hallucination in Multimodal Large Reasoning Models via Reasoning-Conditioned Preference Optimization · 相关公司

A
arXivNONPROFIT
I
IRECNONPROFIT
E
EARNNONPROFIT
E
EATNONPROFIT
A
ACTNONPROFIT
R
RatioRESEARCH_INSTITUTE
C
chainCOMPANY
V
VIACOMPANY