Back to Point: Exploring Point-Language Models for Zero-Shot 3D Anomaly Detection 文章

ArXiv CS.CV2026-06-09NEWSen作者: Kaiqiang Li, Gang Li, Mingle Zhou, Min Li, Delong Han, Jin Wan

详细信息

来源站点
ArXiv CS.CV
作者
Kaiqiang Li, Gang Li, Mingle Zhou, Min Li, Delong Han, Jin Wan
文章类型
NEWS
语言
en
发布日期
2026-06-09

摘要

arXiv:2603.21511v2 Announce Type: replace Abstract: Zero-shot (ZS) 3D anomaly detection is crucial for reliable industrial inspection, as it enables detecting and localizing defects without requiring any target-category training data. Existing approaches render 3D point clouds into 2D images and leverage pre-trained Vision-Language Models (VLMs) for anomaly detection. However, such strategies inevitably discard geometric details and exhibit limited sensitivity to local anomalies. In this paper, we revisit intrinsic 3D representations and explore the potential of pre-trained Point-Language Models (PLMs) for ZS 3D anomaly detection. We propose BTP (Back To Point), a novel framework that effectively aligns 3D point cloud and textual embeddings. Specifically, BTP aligns multi-granularity patch features with textual representations for localized anomaly detection, while incorporating geometric descriptors to enhance sensitivity to structural anomalies.