MVOFormer: Flow-Semantic Transformer for Robust Monocular Visual Odometry 文章

ArXiv CS.CV2026-06-16NEWSen作者: Jituo Li, Shunwang Sun, Jialu Zhang, Xinqi Liu, Jinyao Hu, Zhicheng Lu, Sajad Saeedi, Guodong Lu

查看原文 →

关系图谱

详细信息

来源站点: ArXiv CS.CV
作者: Jituo Li, Shunwang Sun, Jialu Zhang, Xinqi Liu, Jinyao Hu, Zhicheng Lu, Sajad Saeedi, Guodong Lu
文章类型: NEWS
语言: en
发布日期: 2026-06-16

原文

摘要

arXiv:2606.16474v1 Announce Type: new Abstract: Monocular visual odometry (MVO) is foundational to autonomous navigation and robotic localization. However, existing learning-based MVO approaches often struggle with either a lack of interpretable, complementary features or overly complex multi-stage architectures. These limitations inherently restrict their robustness and cross-domain generalization. In this work, we propose MVOFormer, a novel transformer framework for robust monocular visual odometry. Our architecture features a Flow-Semantic Dual Branch Encoder that synergizes dense geometric motion cues with object-centric semantic priors, explicitly distinguishing static structures from dynamic distractors. These representations are then fused by an Iterative Multimodal Decoder, enabling coarse-to-fine pose refinement while dynamically suppressing attention on unreliable regions.

MVOFormer: Flow-Semantic Transformer for Robust Monocular Visual Odometry 文章

详细信息

摘要

相关事件

相关公司

相关人物

相关产品查看全部 (5)

相关技术查看全部 (4)