AgentLeak: A Benchmark for Internal-Channel Privacy Leakage in Multi-Agent LLM Systems 文章

ArXiv CS.AI2026-06-16NEWSen作者: Faouzi El Yagoubi, Godwin Badu-Marfo, Ranwa Al Mallah

详细信息

来源站点: ArXiv CS.AI
作者: Faouzi El Yagoubi, Godwin Badu-Marfo, Ranwa Al Mallah
文章类型: NEWS
语言: en
发布日期: 2026-06-16

摘要

arXiv:2602.11510v3 Announce Type: replace Abstract: Multi-agent Large Language Model (LLM) systems create privacy risks that current output-only benchmarks cannot measure. When agents coordinate on tasks, sensitive data may pass through inter-agent messages, shared memory, and tool arguments, all pathways that final-output audits typically do not inspect. We introduce AgentLeak, a benchmark for evaluating internal-channel privacy leakage in multi-agent LLM systems. AgentLeak instruments seven privacy-relevant communication pathways and provides a large-scale empirical evaluation focused on final outputs, inter-agent messages, and shared memory. Across 1,000 scenarios spanning healthcare, finance, legal, and corporate domains, five production LLMs (GPT-4o, GPT-4o-mini, Claude 3.5 Sonnet, Mistral Large, and Llama 3.3 70B), and 4,979 validated execution traces, we find that multi-agent configurations reduce final-output leakage (C1: 27.2% vs 43.

AgentLeak: A Benchmark for Internal-Channel Privacy Leakage in Multi-Agent LLM Systems 文章

详细信息

摘要

相关事件

相关公司

相关人物

相关产品查看全部 (10)

相关技术查看全部 (1)