VTI-CoT: Visual-Textual Interleaved Chain of Thought for Video Reasoning 文章

ArXiv CS.CV2026-06-05NEWSen作者: Shufan Zhang, Ziyue Lin, Bairun Wang, Lei Jin, Xuanding Ding, Xinzhu Ma, Kunlin Yang

VTI-CoT: Visual-Textual Interleaved Chain of Thought for Video Reasoning · 相关技术