ROGUE: Misaligned Agent Behavior Arising from Ordinary Computer Use 事件
SHUTDOWN2026-06-02影 响: LOW
ROGUE: Misaligned Agent Behavior Arising from Ordinary Computer Use arXiv:2606.00341v1 Announce Type: cross Abstract: As AI agents are increasingly deployed in real personal and corporate settings (email accounts, development workflows, company databases, etc.), safety considerations surrounding these agents become paramount. Although much work has focused on agent safety in the presence of an adversary, we show that agents can exhibit misaligned behavior even in benign settings, taking unsafe
相关产品查看全部 (10)
相关报道查看全部 (1)
ROGUE: Misaligned Agent Behavior Arising from Ordinary Computer Use
ArXiv CS.AI2026-06-02