Consistent Yet Wrong: Evidence Insensitivity in Spatial Vision-Language Models 事件
PRODUCT_LAUNCH2026-06-03影响: MEDIUM
Consistent Yet Wrong: Evidence Insensitivity in Spatial Vision-Language Models arXiv:2606.02742v1 Announce Type: new Abstract: Spatial reasoning is fundamental to robotics, autonomy, and embodied AI, yet modern vision-language models (VLMs) remain unreliable on metric distance queries. A common assumption is that consistent predictions across viewpoints reflect geometric grounding. We test this assumption and find the opposite: leading VLMs often produce view-invariant and consistent answers ev
相关产品查看全部 (10)
相关报道查看全部 (1)
Consistent Yet Wrong: Evidence Insensitivity in Spatial Vision-Language Models
ArXiv CS.CV2026-06-03