Ask4VG: Risk-Aware Question Selection for Reducing Prior-Driven Answers in Medical VQA 文章

ArXiv CS.CV2026-06-02NEWSen作者: Xiaorong Zhu, Qiang Li, Zibo Xu, Weijie Wang, Weizhi Nie

摘要

arXiv:2606.01044v1 Announce Type: new Abstract: Medical visual question answering requires models to ground their responses in image evidence, because visually unsupported answers can mislead downstream interpretation. However, many medical VQA questions are generic, template-like, or highly similar in form, which can encourage models to learn question-answer shortcuts instead of image-dependent reasoning and thereby increase the risk of hallucinated responses. We propose Ask4VG, a label-free pilot framework for risk-aware question selection. Ask4VG estimates question-induced hallucination risk through counterfactual visual probing: the same question is asked under the original image, a perturbed image, a blank image, and a mismatched image, and the resulting answer relations are converted into weak supervision for a counterfactual risk estimator.

相关公司

暂无数据

相关人物

暂无数据

相关技术

暂无数据