Benchmarks for Vision-Language Models in Urban Perception Should Be Reliability-Aware and Negotiated 事件

PRODUCT_LAUNCH2026-06-02影响: MEDIUM

Benchmarks for Vision-Language Models in Urban Perception Should Be Reliability-Aware and Negotiated arXiv:2606.00871v1 Announce Type: new Abstract: Vision-language models (VLMs) are increasingly used to generate structured descriptions of street-level imagery for tasks such as streetscape auditing, mapping, and public consultation. These uses combine observable attributes with appraisal categories, and the human targets are often distributions of judgments with disagreement and explicit non-re