Benchmarking Security Risk Detection and Verification in Open Agentic Skill Ecosystems 事件

PRODUCT_LAUNCH2026-06-02影响: MEDIUM

Benchmarking Security Risk Detection and Verification in Open Agentic Skill Ecosystems arXiv:2606.00925v1 Announce Type: cross Abstract: Open agent platforms allow community contributors to publish reusable skills that agents can invoke at runtime. This extensibility also creates a supply-chain risk: malicious contributors can hide harmful behavior inside skills that appear benign under superficial inspection. However, existing defenses are hard to evaluate because there is no benchmark that me

Benchmarking Security Risk Detection and Verification in Open Agentic Skill Ecosystems · 相关报道