Multi-Modal Cross-Domain Alignment Network for Video Moment Retrieval 事件

PRODUCT_LAUNCH2026-05-26影响: MEDIUM

Multi-Modal Cross-Domain Alignment Network for Video Moment Retrieval arXiv:2209.11572v3 Announce Type: replace Abstract: As an increasingly popular task in multimedia information retrieval, video moment retrieval (VMR) aims to localize the target moment from an untrimmed video according to a given language query. Most previous methods depend heavily on numerous manual annotations (i.e., moment boundaries), which are extremely expensive to acquire in practice. In addition, due to the domain gap