MonoScene: Monocular 3D Semantic Scene Completion 论文

20222022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)引用 253

Advanced Vision and ImagingComputer Graphics and Visualization Techniques3D Shape Modeling and Analysis

Advanced Vision and Imaging Computer Graphics and Visualization Techniques 3D Shape Modeling and Analysis

作者

摘要

MonoScene proposes a 3D Semantic Scene Completion (SSC) framework, where the dense geometry and semantics of a scene are inferred from a single monocular RGB image. Different from the SSC literature, relying on 2.5 or 3D input, we solve the complex problem of 2D to 3D scene reconstruction while jointly inferring its semantics. Our framework relies on successive 2D and 3D UNets, bridged by a novel 2D-3D features projection inspired by optics, and introduces a 3D context relation prior to enforce spatio-semantic consistency. Along with architectural contributions, we introduce novel global scene and local frustums losses. Experiments show we outperform the literature on all metries and datasets while hallucinating plausible scenery even beyond the camera field of view. Our code and trained models are available at https://github.com/cv-rits/MonoScene.

作者查看全部 (2)

Raoul de Charette

Anh-Quan Cao

MonoScene: Monocular 3D Semantic Scene Completion 论文

摘要

作者查看全部 (2)

相关技术查看全部 (1)

相关事件

相关文章