Multi-Modal Cross-Domain Alignment Network for Video Moment Retrieval 事件

PRODUCT_LAUNCH2026-05-26影响: MEDIUM

Multi-Modal Cross-Domain Alignment Network for Video Moment Retrieval arXiv:2209.11572v3 Announce Type: replace Abstract: As an increasingly popular task in multimedia information retrieval, video moment retrieval (VMR) aims to localize the target moment from an untrimmed video according to a given language query. Most previous methods depend heavily on numerous manual annotations (i.e., moment boundaries), which are extremely expensive to acquire in practice. In addition, due to the domain gap

Multi-Modal Cross-Domain Alignment Network for Video Moment Retrieval · 相关技术