LongCat-Video-Avatar 1.5 Technical Report 事件

OPEN_SOURCE2026-05-27影响: MEDIUM

LongCat-Video-Avatar 1.5 Technical Report arXiv:2605.26486v1 Announce Type: new Abstract: Despite advances in audio-driven video generation, achieving commercial-grade stability remains challenging. We present LongCat-Video-Avatar 1.5, an upgraded open-source framework prioritizing systematic engineering and production-readiness over architectural novelty. By upgrading the audio encoder to Whisper Large and meticulously scaling our training recipes, v1.5 achieves accurate lip-synchronization, f