Cross-Modal Attention Calibration for LVLM Hallucination Mitigation 事件

PRODUCT_LAUNCH2026-06-01影响: MEDIUM

Cross-Modal Attention Calibration for LVLM Hallucination Mitigation arXiv:2501.01926v3 Announce Type: replace Abstract: Large vision-language models (LVLMs) have shown remarkable capabilities in visual-language understanding. Despite their success, LVLMs still suffer from generating hallucinations in complex generation tasks, leading to inconsistencies between visual inputs and generated content. To address this issue, some approaches have introduced inference-time interventions, such as contra

Cross-Modal Attention Calibration for LVLM Hallucination Mitigation · 相关技术