TimeSpot: Benchmarking Geo-Temporal Understanding in Vision-Language Models in Real-World Settings 事件

PRODUCT_LAUNCH2026-05-26影响: MEDIUM

TimeSpot: Benchmarking Geo-Temporal Understanding in Vision-Language Models in Real-World Settings arXiv:2603.06687v2 Announce Type: replace Abstract: Geo-temporal understanding, the ability to infer location, time, and contextual properties from visual input alone, underpins applications such as disaster management, traffic planning, embodied navigation, world modeling, and geography education. Although recent vision-language models (VLMs) have advanced image geo-localization using cues like l