Got a Secret? LLM Agents Can't Keep It: Evaluating Privacy in Multi-Agent Systems 事件
PRODUCT_LAUNCH2026-05-28影响: MEDIUM
Got a Secret? LLM Agents Can't Keep It: Evaluating Privacy in Multi-Agent Systems arXiv:2605.27766v1 Announce Type: new Abstract: LLM safety evaluations predominantly test models in isolation, yet deployed AI agents increasingly operate within persistent social environments alongside other agents. We introduce a Moltbook-style simulation platform where thousands of LLM agents interact across communities over a simulated month, and use it to evaluate privacy as a downstream safety concern under
相关产品查看全部 (10)
相关报道查看全部 (1)
Got a Secret? LLM Agents Can't Keep It: Evaluating Privacy in Multi-Agent Systems
ArXiv CS.AI2026-05-28