AnomalyAgent: Training-Free Agentic Models for Zero-/Few-Shot Anomaly Detection 文章

ArXiv CS.CV2026-05-29NEWSen作者: Yi Zhang, Jiawen Zhu, Lele Fu, Guansong Pang

摘要

arXiv:2605.30140v1 Announce Type: new Abstract: Benefiting from generalizability of vision-language models (VLMs) such as CLIP, many zero-/few-shot anomaly detection (AD) approaches have achieved impressive detection performance across various datasets. Nevertheless, they require substantial training on large auxiliary datasets to adapt VLMs to anomaly detection, and their inference largely relies on visual-text embedding similarity-based anomaly scores, lacking reasoning abilities to detect complex anomalies that require in-depth contextual understanding. To address this limitation, we propose \textbf{AnomalyAgent}, a novel training-free, agentic framework that leverages the advanced reasoning and generalization capabilities of multimodal large language models (MLLMs) for anomaly detection.

相关公司

暂无数据

相关人物

暂无数据

相关技术

暂无数据