MSAVBench: Towards Comprehensive and Reliable Evaluation of Multi-Shot Audio-Video Generation 事件

PRODUCT_LAUNCH2026-06-02影响: MEDIUM

MSAVBench: Towards Comprehensive and Reliable Evaluation of Multi-Shot Audio-Video Generation arXiv:2605.20183v2 Announce Type: replace Abstract: Video generation is rapidly evolving from single-shot synthesis to complex multi-shot audio-video (MSAV) narratives to meet real-world demands. However, evaluating such frontier models remains a fundamental challenge. Existing benchmarks are limited in scope and data diversity, and rely on rigid evaluation pipelines, preventing systematic and reliable