JenBridge: Adaptive Long-Form Video Soundtracking across Scene Transitions 文章

ArXiv CS.CV2026-06-02NEWSen作者: Jiashuo Yu, Yao Yao, Boyu Chen, Alex Wang

摘要

arXiv:2606.01703v1 Announce Type: cross Abstract: We address the challenge of generating high-fidelity, long-form soundtracks that remain coherent across scene transitions. Existing AI music systems are mainly designed for short, isolated clips and lack mechanisms to ensure narrative continuity. We present JenBridge, a modular and interpretable framework for adaptive long-form video soundtracking that ensures both high-fidelity audio generation and transition naturalness. The core architecture is a Transformer-based generative model trained with a flow-matching objective, following a two-stage paradigm: pretraining on large-scale text-audio corpora to establish robust musical priors, then adapting to the video domain with dual text-visual conditioning for precise cross-modal alignment. Crucially, to achieve long-form coherence across diverse scene changes, JenBridge incorporates a novel adaptive transition mechanism.

相关公司

暂无数据

相关人物

暂无数据