PoseGAM: Robust Unseen Object Pose Estimation via Geometry-Aware Multi-View Reasoning 文章

ArXiv CS.CV2026-06-16NEWSen作者: Jianqi Chen, Biao Zhang, Xiangjun Tang, Peter Wonka

详细信息

来源站点
ArXiv CS.CV
作者
Jianqi Chen, Biao Zhang, Xiangjun Tang, Peter Wonka
文章类型
NEWS
语言
en
发布日期
2026-06-16

摘要

arXiv:2512.10840v2 Announce Type: replace Abstract: 6D object pose estimation, which predicts the transformation of an object relative to the camera, remains challenging for unseen objects. Existing approaches typically rely on explicitly constructing feature correspondences between the query image and either the object model or template images. In this work, we propose PoseGAM, a geometry-aware multi-view framework that directly predicts object pose from a query image and multiple template images, eliminating the need for explicit matching. Built upon recent multi-view-based foundation model architectures, the method integrates object geometry information through two complementary mechanisms: explicit point-based geometry and learned features from geometry representation networks. In addition, we construct a large-scale synthetic dataset containing more than 190k objects under diverse environmental conditions to enhance robustness and generalization.

相关事件

暂无数据

相关公司

暂无数据

相关人物

暂无数据