Video-ChatGPT: Towards Detailed Video Understanding via Large Vision and Language Models 论文

2024引用 288
Multimodal Machine Learning ApplicationsHuman Pose and Action RecognitionCOVID-19 diagnosis using AI