Hierarchical Local-Global Transformer for Temporal Sentence Grounding 事件

PRODUCT_LAUNCH2026-05-26影响: MEDIUM

Hierarchical Local-Global Transformer for Temporal Sentence Grounding arXiv:2208.14882v2 Announce Type: replace-cross Abstract: This paper studies the multimedia problem of temporal sentence grounding (TSG), which aims to accurately determine the specific video segment in an untrimmed video according to a given sentence query. Traditional TSG methods mainly follow the top-down or bottom-up framework and are not end-to-end. They severely rely on time-consuming post-processing to refine the groun