Geometry-Aware Distillation for Prompt Tuning Biomedical Vision-Language Models 文章

ArXiv CS.CV2026-06-04NEWSen作者: Tran Dinh Tien, Zhiqiang Shen

详细信息

来源站点
ArXiv CS.CV
作者
Tran Dinh Tien, Zhiqiang Shen
文章类型
NEWS
语言
en
发布日期
2026-06-04

摘要

arXiv:2606.04922v1 Announce Type: new Abstract: Current prompt-based and adapter-based tuning of vision-language models (VLMs) is attractive for medical imaging, where clinical data sensitivity favors frozen backbones and annotations are limited. However, these methods typically optimize only the ground-truth class, treating all other classes as equally incorrect, ignoring clinically meaningful class relations and yielding unstable decision boundaries in limited-supervision settings. We propose Omni-Geometry Knowledge Distillation (OGKD), a new framework that injects class-relation structure into the teacher to produce directional targets that preserve the ground truth while respecting inter-class geometry. Using these targets, we develop two distillation losses: Global Geometry-Aware Distillation (GAD) operates on the global image token, and Label-Guided Geometry Distillation (LGD) applies the same geometry to attentive patch tokens to improve fine-grained alignment.

相关事件

暂无数据

相关公司

暂无数据

相关人物

暂无数据

相关产品

暂无数据