PoseGAM: Robust Unseen Object Pose Estimation via Geometry-Aware Multi-View Reasoning 文章

ArXiv CS.CV2026-06-16NEWSen作者: Jianqi Chen, Biao Zhang, Xiangjun Tang, Peter Wonka

详细信息

来源站点: ArXiv CS.CV
作者: Jianqi Chen, Biao Zhang, Xiangjun Tang, Peter Wonka
文章类型: NEWS
语言: en
发布日期: 2026-06-16

摘要

arXiv:2512.10840v2 Announce Type: replace Abstract: 6D object pose estimation, which predicts the transformation of an object relative to the camera, remains challenging for unseen objects. Existing approaches typically rely on explicitly constructing feature correspondences between the query image and either the object model or template images. In this work, we propose PoseGAM, a geometry-aware multi-view framework that directly predicts object pose from a query image and multiple template images, eliminating the need for explicit matching. Built upon recent multi-view-based foundation model architectures, the method integrates object geometry information through two complementary mechanisms: explicit point-based geometry and learned features from geometry representation networks. In addition, we construct a large-scale synthetic dataset containing more than 190k objects under diverse environmental conditions to enhance robustness and generalization.

PoseGAM: Robust Unseen Object Pose Estimation via Geometry-Aware Multi-View Reasoning 文章

详细信息

摘要

相关事件

相关公司

相关人物

相关产品查看全部 (2)

相关技术查看全部 (2)