Capturing Gaze Shifts for Guidance: Cross-Modal Fusion Enhancement for VLM Hallucination Mitigation 文章

ArXiv CS.CV2026-06-01NEWSen作者: Zheng Qi, Chao Shang, Evangelia Spiliopoulou, Nikolaos Pappas

Capturing Gaze Shifts for Guidance: Cross-Modal Fusion Enhancement for VLM Hallucination Mitigation · 相关技术