Detect in Any Scene: An Agentic Framework for Object Detection with Experience-Aware Reasoning 文章

ArXiv CS.CV2026-06-01NEWSen作者: Wenlun Zhang, Jun Yin, Kentaro Yoshioka

摘要

arXiv:2605.31174v1 Announce Type: new Abstract: Object detection in real-world scenarios remains challenging due to diverse image degradations and heterogeneous object distributions, which significantly hinder the generalization of existing detectors. Conventional approaches, including scene-specific representation learning and end-to-end pipeline design, are inherently limited by their reliance on predefined conditions and lack adaptability to dynamic environments. In this paper, we propose DetAS, an agentic detection framework that formulates object detection as a dynamic decision process. Instead of relying on static pipelines, DetAS leverages a Multimodal Large Language Model (MLLM) as a central agent to adaptively compose detection workflows by selecting from a toolbox of restoration modules and specialized detectors.

相关公司

暂无数据

相关人物

暂无数据

相关产品

暂无数据