Geometry-Aware Distillation for Prompt Tuning Biomedical Vision-Language Models 文章

ArXiv CS.CV2026-06-04NEWSen作者: Tran Dinh Tien, Zhiqiang Shen

详细信息

来源站点: ArXiv CS.CV
作者: Tran Dinh Tien, Zhiqiang Shen
文章类型: NEWS
语言: en
发布日期: 2026-06-04

摘要

arXiv:2606.04922v1 Announce Type: new Abstract: Current prompt-based and adapter-based tuning of vision-language models (VLMs) is attractive for medical imaging, where clinical data sensitivity favors frozen backbones and annotations are limited. However, these methods typically optimize only the ground-truth class, treating all other classes as equally incorrect, ignoring clinically meaningful class relations and yielding unstable decision boundaries in limited-supervision settings. We propose Omni-Geometry Knowledge Distillation (OGKD), a new framework that injects class-relation structure into the teacher to produce directional targets that preserve the ground truth while respecting inter-class geometry. Using these targets, we develop two distillation losses: Global Geometry-Aware Distillation (GAD) operates on the global image token, and Label-Guided Geometry Distillation (LGD) applies the same geometry to attentive patch tokens to improve fine-grained alignment.

Geometry-Aware Distillation for Prompt Tuning Biomedical Vision-Language Models 文章

详细信息

摘要

相关事件

相关公司

相关人物

相关产品

相关技术查看全部 (4)