Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks 论文

2020Lecture notes in computer science引用 1512
Multimodal Machine Learning ApplicationsDomain Adaptation and Few-Shot LearningAdvanced Image and Video Retrieval Techniques