Resolving Ambiguity in Composed Image Retrieval via Calibrated Interaction 文章

ArXiv CS.CV2026-05-26NEWSen作者: Amsisan Tran, Baogh Le, Tuan Kiet Pham, Sui Yang Guang

摘要

arXiv:2605.24634v1 Announce Type: new Abstract: Composed image retrieval (CIR) searches a corpus with a reference image and a text describing how to modify it. Despite rapid progress from triplet-trained compositors to zero-shot and generative methods, essentially all systems share one assumption: that a query maps to a single target, scored by Recall@K against one annotation. We argue this is fundamentally at odds with the task. A query such as make it more formal does not name an image but a region of the corpus, and which member the user intends is genuinely underdetermined. This underspecification is the root of the well-known false-negative problem and leaves current models unable to tell a precise query from an ambiguous one. We reframe CIR as calibrated intent resolution under uncertainty: a retriever is wrapped in a conformal prediction layer that returns a candidate set with a coverage guarantee and whose size is a principled measure of ambiguity;

相关公司

暂无数据

相关人物

暂无数据

相关产品

暂无数据