Prompting Visual-Language Models for Efficient Video Understanding 论文

2022Lecture notes in computer science引用 346
Multimodal Machine Learning ApplicationsHuman Pose and Action RecognitionDomain Adaptation and Few-Shot Learning

Prompting Visual-Language Models for Efficient Video Understanding · 相关文章

暂无数据