More Skills, Worse Agents? Skill Shadowing Degrades Performance When Expanding Skill Libraries 文章

ArXiv CS.AI2026-05-26NEWSen作者: Hongwen Song (Vinson), Song (Vinson), Wei

摘要

arXiv:2605.24050v1 Announce Type: cross Abstract: Skill libraries allow LLM agents to load task-specific instructions on demand, letting non-expert users solve domain-specific tasks through natural language without knowing which skills exist or how they work. However, performance degrades as libraries grow -- by up to 21\% when scaling from a small set of helpful skills to a 202-skill library. In this work, we formulate this performance degradation as the pass rate drop between loading a library of known-helpful skills and the full library. Moreover, we propose to decompose the pass rate drop by conditioning on the skill(s) invocation -- which skills the agent selects during a trajectory -- into two effects: \emph{skill shadowing}, where the agent selects wrong skills more often as the library expands, and \emph{context overhead}, where the enlarged context degrades execution even when selection is correct.