Temporal2Seq: A Unified Framework for Temporal Video Understanding Tasks 事件

PRODUCT_LAUNCH2026-06-11影响: MEDIUM

Temporal2Seq: A Unified Framework for Temporal Video Understanding Tasks arXiv:2409.18478v2 Announce Type: replace Abstract: With the development of video understanding, there is a proliferation of tasks for clip-level temporal video analysis, including temporal action detection (TAD), temporal action segmentation (TAS), and generic event boundary detection (GEBD). While task-specific video understanding models have exhibited outstanding performance in each task, there remains a dearth of a uni

Temporal2Seq: A Unified Framework for Temporal Video Understanding Tasks · 相关人物