Generalizable Vision-Language Few-Shot Adaptation with Predictive Prompts and Negative Learning 文章

ArXiv CS.CV2026-05-26NEWSen作者: Sriram Mandalika

摘要

arXiv:2505.11758v2 Announce Type: replace Abstract: Few-shot adaptation of vision-language models remains fundamentally limited by how negative class signals are handled at inference. Existing methods apply uniform negative suppression across all queries, ignoring that the most damaging confusions are query-specific and shift with support-set geometry. We introduce SCAN (Selective Confusion-Aware Negatives), a framework that addresses this gap through three targeted contributions. In inference, query-adaptive negative routing restricts suppression to the top-K most confusable classes per query, requiring zero additional parameters. Generic negative text templates are replaced with LLM-bootstrapped contrastive prompts that describe discriminative attributes between confusable class pairs, sharpening the textual decision boundary where it matters most.

Generalizable Vision-Language Few-Shot Adaptation with Predictive Prompts and Negative Learning 文章

摘要

相关事件查看全部 (1)

相关公司查看全部 (4)

相关人物

相关产品查看全部 (5)

相关技术查看全部 (31)