An Open-Source Benchmark and Baseline for Multi-temporal Referring Segmentation 事件

Name: An Open-Source Benchmark and Baseline for Multi-temporal Referring Segmentation
Start: 2026-06-02

PRODUCT_LAUNCH2026-06-02影响: MEDIUM

An Open-Source Benchmark and Baseline for Multi-temporal Referring Segmentation arXiv:2606.00987v1 Announce Type: new Abstract: Large Vision-Language Models (LVLMs) have shown strong visual understanding and language-guided grounding abilities, yet their capacity for multi-temporal visual reasoning remains underexplored. To bridge this gap, we introduce \textbf{Multi-temporal Referring Segmentation (MTRS)}, a new task that aims to segment language-described temporal changes from multi-temporal

人工智能

关系图谱