A Thousand Frames in Just a Few Words: Lingual Description of Videos through Latent Topics and Sparse Object Stitching 论文

2013引用 294
Multimodal Machine Learning ApplicationsAdvanced Image and Video Retrieval TechniquesVideo Analysis and Summarization

A Thousand Frames in Just a Few Words: Lingual Description of Videos through Latent Topics and Sparse Object Stitching · 相关技术