VolFill: Single-View Amodal 3D Scene Reconstruction with Volumetric Flow Matching 文章

ArXiv CS.CV2026-06-01NEWSen作者: Tuan Duc Ngo, Chuang Gan, Evangelos Kalogerakis

摘要

arXiv:2605.31466v1 Announce Type: new Abstract: Reconstructing the complete geometry of a scene from a single RGB image remains challenging - especially when inferring hidden structures where visual evidence is incomplete. We introduce VolFill, a generative framework that predicts the 3D structure of the complete scene rather than relying on traditional pixel-aligned regression. Our method utilizes a hybrid 3D VAE to compress sparse truncated unsigned distance function grids into a compact latent space, paired with a latent Diffusion Transformer that denoises this representation to recover the complete scene. We condition the generation on geometry foundation models, leveraging rich spatial priors for robust reasoning. Unlike existing methods limited by per-ray constraints or unstructured point-cloud queries, VolFill provides a structured representation that supports direct surface extraction and occupancy queries at scale.

VolFill: Single-View Amodal 3D Scene Reconstruction with Volumetric Flow Matching 文章

摘要

相关事件查看全部 (1)

相关公司

相关人物

相关产品查看全部 (2)

相关技术查看全部 (4)