Multi-Modal Cross-Domain Alignment Network for Video Moment Retrieval 事件
PRODUCT_LAUNCH2026-05-26影响: MEDIUM
Multi-Modal Cross-Domain Alignment Network for Video Moment Retrieval arXiv:2209.11572v3 Announce Type: replace Abstract: As an increasingly popular task in multimedia information retrieval, video moment retrieval (VMR) aims to localize the target moment from an untrimmed video according to a given language query. Most previous methods depend heavily on numerous manual annotations (i.e., moment boundaries), which are extremely expensive to acquire in practice. In addition, due to the domain gap
Multi-Modal Cross-Domain Alignment Network for Video Moment Retrieval · 相关报道
相关报道
Multi-Modal Cross-Domain Alignment Network for Video Moment Retrieval
ArXiv CS.CV2026-05-26