Entangled Transformer for Image Captioning 论文

2019引用 387
Multimodal Machine Learning ApplicationsAdvanced Image and Video Retrieval TechniquesHuman Pose and Action Recognition

Entangled Transformer for Image Captioning · 相关技术