Colosseum: Auditing Collusion in Cooperative Multi-Agent Systems 事件

PRODUCT_LAUNCH2026-05-28影响: MEDIUM

Colosseum: Auditing Collusion in Cooperative Multi-Agent Systems arXiv:2602.15198v2 Announce Type: replace-cross Abstract: Multi-agent systems, where LLM agents communicate through free-form language, enable sophisticated coordination for solving complex cooperative tasks. This surfaces a unique safety problem when a group of agents forms a coalition and colludes to pursue secondary goals and degrade the joint objective. In this paper, we present Colosseum, a framework for auditing LLM agents'