ROGUE: Misaligned Agent Behavior Arising from Ordinary Computer Use 事件
SHUTDOWN2026-06-02影响: LOW
ROGUE: Misaligned Agent Behavior Arising from Ordinary Computer Use arXiv:2606.00341v1 Announce Type: cross Abstract: As AI agents are increasingly deployed in real personal and corporate settings (email accounts, development workflows, company databases, etc.), safety considerations surrounding these agents become paramount. Although much work has focused on agent safety in the presence of an adversary, we show that agents can exhibit misaligned behavior even in benign settings, taking unsafe