SkillSafetyBench: Evaluating Agent Safety under Skill-Facing Attack Surfaces 事件
PRODUCT_LAUNCH2026-05-28影响: MEDIUM
SkillSafetyBench: Evaluating Agent Safety under Skill-Facing Attack Surfaces arXiv:2605.12015v2 Announce Type: replace-cross Abstract: Reusable skills are becoming a common interface for extending large language model agents, packaging procedural guidance with access to files, tools, memory, and execution environments. However, this modularity introduces attack surfaces that are largely missed by existing safety evaluations: even when the user request is benign, unsafe influence may reside in s
相关产品查看全部 (10)
相关报道查看全部 (1)
SkillSafetyBench: Evaluating Agent Safety under Skill-Facing Attack Surfaces
ArXiv CS.CL2026-05-28