Would you still call this Dax? Novel Visual References in VLMs and Humans 事件

PRODUCT_LAUNCH2026-06-05影响: MEDIUM

Would you still call this Dax? Novel Visual References in VLMs and Humans arXiv:2606.05409v1 Announce Type: new Abstract: Vision-language models (VLMs), like human learners, are frequently exposed to new visual concepts, but how they map novel visual references to language after exposure remains largely underexplored, particularly when those references contradict prior knowledge from pre-training. To study this, we present the Novel Visual References Dataset (NVRD): 19,176 images spanning 90 vi