Prompting Visual-Language Models for Efficient Video Understanding 论文
2022Lecture notes in computer science引用 346
Multimodal Machine Learning ApplicationsHuman Pose and Action RecognitionDomain Adaptation and Few-Shot Learning
Prompting Visual-Language Models for Efficient Video Understanding · 相关文章
暂无数据