CLIP4Clip: An empirical study of CLIP for end to end video clip retrieval and captioning 论文

2022Neurocomputing引用 668
Multimodal Machine Learning ApplicationsAdvanced Image and Video Retrieval TechniquesVideo Analysis and Summarization