Understanding-Enhanced Model Collaboration for Long-Tailed Egocentric Mistake Detection 文章

ArXiv CS.CV2026-06-02NEWSen作者: Boyu Han, Qianqian Xu, Shilong Bao, Zhiyong Yang, Ruochen Cui, Qingming Huang

摘要

arXiv:2606.02120v1 Announce Type: new Abstract: In this report, we address the problem of determining whether a user performs an action incorrectly from egocentric video data. To this end, we propose an Understanding-Enhanced Model Collaboration Method (UE-MCM) that combines efficient coarse-grained video understanding with accurate fine-grained action reasoning. Specifically, UE-MCM contains a small model branch and a large model branch. The large model branch focuses on whether the fine-grained action itself is executed incorrectly, while the small model branch jointly takes the coarse-grained video and fine-grained segment as input to identify actions that may be locally correct but inconsistent with the overall workflow.

相关公司

暂无数据

相关人物

暂无数据

相关产品

暂无数据