Cross-modal linkage risk in clinical vision-language models 文章

ArXiv CS.CV2026-06-02NEWSen作者: Soroosh Tayebi Arasteh, Mahshad Lotfinia, Sven Nebelung, Daniel Truhn

详细信息

来源站点: ArXiv CS.CV
作者: Soroosh Tayebi Arasteh, Mahshad Lotfinia, Sven Nebelung, Daniel Truhn
文章类型: NEWS
语言: en
发布日期: 2026-06-02

摘要

arXiv:2606.02276v1 Announce Type: new Abstract: Vision-language models (VLMs) trained on paired chest radiographs and radiology reports learn a shared embedding space that can preserve instance-level image-report correspondence. This poses a privacy risk in settings where radiographs and reports are deliberately kept separate after acquisition, such as image-only data sharing or access-controlled reports, because a de-identified image may be re-linked to its original narrative report through cosine similarity alone. We formalized this as image-to-report retrieval and used public paired cohorts, in which the true pairing is known by design, as ground-truth benchmarks to audit the risk rather than as the privacy scenario.

Cross-modal linkage risk in clinical vision-language models 文章

详细信息

摘要

相关事件查看全部 (1)

相关公司

相关人物

相关产品

相关技术查看全部 (1)