An Empirical Study of Training End-to-End Vision-and-Language Transformers 论文

20222022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)引用 314
Multimodal Machine Learning ApplicationsDomain Adaptation and Few-Shot LearningAdvanced Neural Network Applications

An Empirical Study of Training End-to-End Vision-and-Language Transformers · 相关文章

暂无数据