CodecCap: High-Fidelity Codec-Inspired Residual Modeling for Dense Video Captioning 事件

PRODUCT_LAUNCH2026-05-27影响: MEDIUM

CodecCap: High-Fidelity Codec-Inspired Residual Modeling for Dense Video Captioning arXiv:2605.26967v1 Announce Type: new Abstract: Existing video captioning methods struggle to balance visual fidelity and redundancy: holistic captions are compact but lose fine-grained evidence, whereas segment-wise captions improve coverage but introduce heavy redundancy. We propose CodecCap, a codec-inspired framework for high-fidelity dense video captioning. Analogous to video codecs, CodecCap represents vid

CodecCap: High-Fidelity Codec-Inspired Residual Modeling for Dense Video Captioning · 相关技术