SPICE: Semantic Propositional Image Caption Evaluation 论文

2016Lecture notes in computer science引用 1918
Multimodal Machine Learning ApplicationsHuman Pose and Action RecognitionVideo Analysis and Summarization