GMOS: Grounding Moving Object Segmentation in 3D Space and Time 文章

ArXiv CS.CV2026-05-29NEWSen作者: Junyu Xie, Tengda Han, Weidi Xie, Andrew Zisserman

摘要

arXiv:2605.30352v1 Announce Type: new Abstract: Moving Object Segmentation (MOS) aims to discover, segment, and track objects that move independently of the camera. Current MOS methods, however, exhibit two fundamental limitations: they rely on pre-computed 2D auxiliary modalities such as optical flow or point trajectories that lack 3D geometric information, and they treat motion as a sequence-level attribute, overlooking the instantaneous motion state of each object. We address both by grounding MOS in 3D space and time, and propose GMOS, a framework that operates directly on RGB video to produce 3D-aware, temporally fine-grained segmentation of multiple moving objects, alongside a foreground--background variant GMOS-S for faster deployment.

相关公司

暂无数据

相关人物

暂无数据