Voluntary Collusion with Secret Tools in Competing LLM Agents 事件

PRODUCT_LAUNCH2026-05-28影响: MEDIUM

Voluntary Collusion with Secret Tools in Competing LLM Agents arXiv:2605.27593v1 Announce Type: new Abstract: Even when a tool is explicitly described as unfair and harmful to others, ostensibly safety-aligned LLM agents still voluntarily engage in secret collusion whenever doing so confers a strategic advantage. To investigate this phenomenon, we introduce an empirical framework built on two strategic multi-agent environments: Liar's Bar, a competitive deception scenario, and Cleanup, a mixed-

Voluntary Collusion with Secret Tools in Competing LLM Agents · 相关公司

F
FAIRRESEARCH_INSTITUTE
R
RonCOMPANY
A
arXivNONPROFIT
F
FrameworkCOMPANY
O
OLSNONPROFIT
A
ACTNONPROFIT
E
EGINONPROFIT
F
FINDNONPROFIT