Uni-RCM: Unified Reference-guided Cross-modal Mapping for Multi-Class Anomaly Detection 文章

ArXiv CS.CV2026-05-29NEWSen作者: Yangchen Wu, Huiqiang Xie

摘要

arXiv:2605.29455v1 Announce Type: new Abstract: Multi-modal industrial anomaly detection typically relies on separate models for each product category, fundamentally limiting practical scalability. When shifting to a unified paradigm that handles diverse classes simultaneously, detection accuracy often degrades due to inter-class interference and feature manifold confusion. To overcome these challenges, we propose a Unified Reference guided Cross-modal Mapping framework, named Uni-RCM. At its core, we propose a reference guide block to dynamically filter out category-specific noise by introducing a learnable reference feature, which captures the commonalities across different modalities. Besides, an offline residual quantizer is proposed to characterize the normal distribution by multiple cascaded codebooks.