Mitigating Hallucination in Vision-Language Models through Barrier-Regulated Adaptive Closed-form Steering 文章

ArXiv CS.CV2026-05-29NEWSen作者: Soumyadeep Jana, Pulkit Mittal, Sanasam Ranbir Singh

摘要

arXiv:2605.29881v1 Announce Type: new Abstract: Large vision-language models (LVLMs) often hallucinate objects that are not present in the input image, largely because visual grounding weakens as decoding progresses. Existing inference-time mitigation methods modify logits or hidden states throughout generation, but they suffer from three key limitations: they lack an explicit grounding objective, intervene even when the model is already well-grounded, and use fixed correction strengths that do not adapt to the severity of grounding failure. We propose BRACS (Barrier-Regulated Adaptive Closed-form Steering), a training-free steering framework that addresses these issues through barrier-regulated adaptive closed-form steering. BRACS monitors the model's own attention to measure visual grounding and applies corrections to the hidden states only when grounding deteriorates.

相关公司

暂无数据

相关人物

暂无数据

相关产品

暂无数据