GMOS: Grounding Moving Object Segmentation in 3D Space and Time 文章

ArXiv CS.CV2026-05-29NEWSen作者: Junyu Xie, Tengda Han, Weidi Xie, Andrew Zisserman

摘要

arXiv:2605.30352v1 Announce Type: new Abstract: Moving Object Segmentation (MOS) aims to discover, segment, and track objects that move independently of the camera. Current MOS methods, however, exhibit two fundamental limitations: they rely on pre-computed 2D auxiliary modalities such as optical flow or point trajectories that lack 3D geometric information, and they treat motion as a sequence-level attribute, overlooking the instantaneous motion state of each object. We address both by grounding MOS in 3D space and time, and propose GMOS, a framework that operates directly on RGB video to produce 3D-aware, temporally fine-grained segmentation of multiple moving objects, alongside a foreground--background variant GMOS-S for faster deployment.

GMOS: Grounding Moving Object Segmentation in 3D Space and Time 文章

摘要

相关事件查看全部 (1)

相关公司

相关人物

相关产品查看全部 (6)

相关技术查看全部 (2)