Rethinking Visual Neglect: Steering via Context-Preference for MLLM Hallucination Mitigation 文章

ArXiv CS.CL2026-05-28NEWSen作者: Jingwen Wu, Xijun Zhang, Ge Song

摘要

arXiv:2605.27993v1 Announce Type: new Abstract: Object hallucination remains a primary obstacle to the reliable deployment of Multimodal Large Language Models (MLLMs). Current inference-time mitigation methods mainly assume hallucinations stem from visual neglect, steering models to enhance visual reliance. In contrast, our systematic interventions on multiple MLLMs show that pushing toward more visual reliance may exacerbate hallucinations on some models, while less may mitigate hallucinations. This result suggests that attributing hallucinations solely to visual insufficiency is underdetermined. We argue that the image, as a context, simultaneously competes with the model's parametric knowledge and the textual context. For this, we propose a training-free framework, Context-Preference Activation Steering (CAS).

相关公司

暂无数据

相关人物

暂无数据

相关产品

暂无数据