Hijacking Agent Memory: Stealthy Trojan Attacks Through Conversational Interaction 文章

ArXiv CS.AI2026-05-29NEWSen作者: Hongtao Wang, Se Yang, Yu Chen, Puzhuo Liu

摘要

arXiv:2605.29960v1 Announce Type: cross Abstract: Large language model (LLM) agents increasingly leverage long term memory to support persistent and autonomous task execution. However, this capability also introduces a new attack surface: memory poisoning, where adversaries can inject malicious information to influence future behavior. Existing memory poisoning attacks often assume that injected content can be stored directly in memory, overlooking the selective extraction and rewriting stages in modern memory pipelines. This makes prior methods ineffective under realistic settings. In this paper, we propose MemPoison, a novel memory poisoning attack that bypasses selective memory mechanisms in LLM agents, where an attacker can inject triggerable backdoors into the agent's long-term memory through dialogue interactions, thereby misleading its subsequent responses.

相关公司

暂无数据

相关人物

暂无数据

相关产品

暂无数据