MemoVAD: Resource-Efficient Video Anomaly Detection via Dynamic Semantic Memory in Edge Computing Scenarios 文章

ArXiv CS.CV2026-06-09NEWSen作者: Guo Li, Jiandian Zeng, Yang Li, Zihao Peng, Ke Chen, Tian Wang

详细信息

来源站点
ArXiv CS.CV
作者
Guo Li, Jiandian Zeng, Yang Li, Zihao Peng, Ke Chen, Tian Wang
文章类型
NEWS
语言
en
发布日期
2026-06-09

摘要

arXiv:2606.07669v1 Announce Type: new Abstract: Deploying Video Anomaly Detection (VAD) in real-world surveillance faces a fundamental tension between the demand for high-level semantics to ensure effectiveness and the limited computational resources of edge devices. Vision-Language Models (VLMs) provide rich open-vocabulary semantics, but their latency and computational cost preclude on-device deployment. To address the challenge, we propose MemoVAD, an edge-cloud collaborative framework that selectively incorporates VLM semantics into streaming VAD. MemoVAD runs most inference on the edge with a lightweight detector and a causal Temporal Context Encoder (TCE) to model temporal dependencies. Specifically, we introduce an Uncertainty-Aware Gating (UAG) policy grounded in Subjective Logic to model perceived uncertainty and query the cloud-based VLM only for high-uncertainty and semantically novel clips.

相关事件

暂无数据

相关公司

暂无数据

相关人物

暂无数据