The Surface You Test Is Not the Surface That Breaks 事件

PRODUCT_LAUNCH2026-06-01影响: MEDIUM

The Surface You Test Is Not the Surface That Breaks arXiv:2605.30454v1 Announce Type: cross Abstract: Tool-augmented LLM agents are vulnerable to prompt injection: a third party who controls part of the agent's context can plant instructions that the agent then executes as if they came from the user. Current evaluations report a single attack success rate per model on one channel, the tool output and treat that number as the model's vulnerability. But tool descriptions, which the agent reads at

The Surface You Test Is Not the Surface That Breaks · 相关产品