The Surface You Test Is Not the Surface That Breaks 事件

PRODUCT_LAUNCH2026-06-01影响: MEDIUM

The Surface You Test Is Not the Surface That Breaks arXiv:2605.30454v1 Announce Type: cross Abstract: Tool-augmented LLM agents are vulnerable to prompt injection: a third party who controls part of the agent's context can plant instructions that the agent then executes as if they came from the user. Current evaluations report a single attack success rate per model on one channel, the tool output and treat that number as the model's vulnerability. But tool descriptions, which the agent reads at