Look-Closer-Then-Diagnose: Confidence-Aware Ultrasound VQA via Active Zooming 事件

PRODUCT_LAUNCH2026-05-26影响: MEDIUM

Look-Closer-Then-Diagnose: Confidence-Aware Ultrasound VQA via Active Zooming arXiv:2605.21652v2 Announce Type: replace Abstract: Vision-Language Models (VLMs) have significantly advanced medical visual question answering, yet their performance in ultrasound remains suboptimal. In clinical practice, sonographers explicitly focus on lesion regions to formulate reports, though diagnostic interpretations sometimes vary due to inherent subjectivity. However, existing VLMs are not explicitly structu

Look-Closer-Then-Diagnose: Confidence-Aware Ultrasound VQA via Active Zooming · 相关人物