Real-Time Generation of Streamable Talking Portrait Video with Reference-Guided Deep Compression VAEs 事件
PRODUCT_LAUNCH2026-06-02影响: MEDIUM
Real-Time Generation of Streamable Talking Portrait Video with Reference-Guided Deep Compression VAEs arXiv:2606.01620v1 Announce Type: new Abstract: Video diffusion models have significantly advanced portrait video generation, yet their high computational demands limit their use in interactive applications. This work presents a framework for streamable talking portrait video generation conditioned on speech audio and reference images. Designed meticulously for streaming scenarios, it features