Video-MTR: Reinforced Multi-Turn Reasoning for Long Video Understanding 事件

PRODUCT_LAUNCH2026-06-01影响: MEDIUM

Video-MTR: Reinforced Multi-Turn Reasoning for Long Video Understanding arXiv:2508.20478v2 Announce Type: replace Abstract: Long-form video understanding, characterized by long-range temporal dependencies and multiple events, remains a challenge. Existing methods often rely on static reasoning or external visual-language models (VLMs), which face issues like complexity and sub-optimal performance due to the lack of end-to-end training. In this paper, we propose Video-MTR, a reinforced multi-tur

Video-MTR: Reinforced Multi-Turn Reasoning for Long Video Understanding · 相关公司

G
GLENONPROFIT
T
TERINONPROFIT
E
EnsionCOMPANY
F
FrameworkCOMPANY
I
IterRESEARCH_INSTITUTE
A
ANDINONPROFIT
T
TemporaRESEARCH_INSTITUTE
A
ACTNONPROFIT
C
CharacterNONPROFIT
I
iterativeCOMPANY