Cross-Modal Causal Relational Reasoning for Event-Level Visual Question Answering 论文
2023IEEE Transactions on Pattern Analysis and Machine Intelligence引用 330
Multimodal Machine Learning ApplicationsHuman Pose and Action RecognitionAdvanced Image and Video Retrieval Techniques