Benchmarking Security Risk Detection and Verification in Open Agentic Skill Ecosystems 事件
PRODUCT_LAUNCH2026-06-02影响: MEDIUM
Benchmarking Security Risk Detection and Verification in Open Agentic Skill Ecosystems arXiv:2606.00925v1 Announce Type: cross Abstract: Open agent platforms allow community contributors to publish reusable skills that agents can invoke at runtime. This extensibility also creates a supply-chain risk: malicious contributors can hide harmful behavior inside skills that appear benign under superficial inspection. However, existing defenses are hard to evaluate because there is no benchmark that me