SpongeBob: Sync-Aware Harmonious Audio-Visual Generative Editing 事件

PRODUCT_LAUNCH2026-05-26影响: MEDIUM

SpongeBob: Sync-Aware Harmonious Audio-Visual Generative Editing arXiv:2605.25193v1 Announce Type: new Abstract: Visual and acoustic events in the physical world are inherently coupled, yet existing video editing methods typically adopt decoupled pipelines, lacking bidirectional modality interaction. This results in two key limitations: (i) audio-visual desynchronization and (ii) contextual conflicts between generated audio and preserved content. To address these, we propose SpongeBob, the firs