The multi-modal fusion in visual question answering: a review of attention mechanisms 论文

2023PeerJ Computer Science引用 467顶会
Multimodal Machine Learning ApplicationsDomain Adaptation and Few-Shot LearningAdvanced Technologies in Various Fields

The multi-modal fusion in visual question answering: a review of attention mechanisms · 作者