Coding with "Enemy": Can Human Developers Detect AI Agent Sabotage? 事件
PRODUCT_LAUNCH2026-06-05影响: MEDIUM
Coding with "Enemy": Can Human Developers Detect AI Agent Sabotage? arXiv:2606.05647v1 Announce Type: cross Abstract: AI coding agents are increasingly embedded in real-world software development, collaborating with human developers while gaining broader access to codebases and tools. This creates a new attack surface: an agent can exploit human trust to sabotage development, for instance by inserting malicious code to accomplish a hidden side task. Most prior work studies AI sabotage in AI-onl