Towards Learning a Generic Agent for Vision-and-Language Navigation via Pre-Training 论文

2020引用 239
Multimodal Machine Learning ApplicationsDomain Adaptation and Few-Shot LearningAdvanced Image and Video Retrieval Techniques

Towards Learning a Generic Agent for Vision-and-Language Navigation via Pre-Training · 相关技术