Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations 论文
2017International Journal of Computer Vision引用 5145
Multimodal Machine Learning ApplicationsImage Retrieval and Classification TechniquesAdvanced Image and Video Retrieval Techniques