Edge Prediction for Roof Wireframe Reconstruction with Transformers 文章

ArXiv CS.CV2026-06-02NEWSen作者: Gustav Hanning, Ludvig Dill\'en, Jonathan Astermark, Johanna Lidholm, Viktor Larsson

摘要

arXiv:2606.02406v1 Announce Type: new Abstract: This paper presents a competitive solution to the S23DR Challenge 2026, which aims to reconstruct 3D house roof wireframe models from sparse SfM point clouds and ground-level semantic segmentations and depth maps. Our proposed method utilizes an end-to-end Transformer encoder-decoder architecture inspired by DETR. To effectively process the geometric and semantic data, the sparse SfM point cloud input is dynamically subsampled based on semantic priority and augmented with Gestalt and ADE20k class features. To further increase segmentation context, we fuse the point features with additional Gestalt feature encodings which are obtained by projecting the points into latent feature maps produced by a frozen autoencoder. Learned query embeddings are then decoded directly into 3D wireframe edges via cross-attention mechanisms.

相关公司

暂无数据

相关人物

暂无数据