Towards Open-World Referring Expression Comprehension: A Benchmark with Training-free Multi-task Consistency Checker 事件

PRODUCT_LAUNCH2026-05-26影响: MEDIUM

Towards Open-World Referring Expression Comprehension: A Benchmark with Training-free Multi-task Consistency Checker arXiv:2605.25706v1 Announce Type: new Abstract: Referring expression comprehension (REC) aims to localize a target object within an image based on a given expression. Although recent advances in vision-language models have led to substantial improvements in REC tasks, current REC benchmarks often hold simple scenarios and the assumption that each expression maps to a unique objec