SkillSafetyBench: Evaluating Agent Safety under Skill-Facing Attack Surfaces 事件

SHUTDOWN2026-05-28影响: LOW

SkillSafetyBench: Evaluating Agent Safety under Skill-Facing Attack Surfaces arXiv:2605.12015v2 Announce Type: replace-cross Abstract: Reusable skills are becoming a common interface for extending large language model agents, packaging procedural guidance with access to files, tools, memory, and execution environments. However, this modularity introduces attack surfaces that are largely missed by existing safety evaluations: even when the user request is benign, unsafe influence may reside in s