VGGSounder: Audio-Visual Evaluations for Foundation Models 文章

ArXiv CS.CV2026-06-04NEWSen作者: Daniil Zverev, Thadd\"aus Wiedemer, Ameya Prabhu, Matthias Bethge, Wieland Brendel, A. Sophia Koepke

VGGSounder: Audio-Visual Evaluations for Foundation Models · 相关技术