From Model Scaling to System Scaling: Scaling the Harness in Agentic AI 文章

ArXiv CS.AI2026-05-26NEWSen作者: Shangding Gu

摘要

arXiv:2605.26112v1 Announce Type: new Abstract: This paper studies the next major bottleneck in agentic AI as system scaling, not only model scaling: the design of auditable, persistent, modular, and verifiable architectures around foundation models. We refer to this shift as scaling the harness: treating the structured execution layer around a foundation model as a first-class object of design, evaluation, and optimization. Although recent large language models enable agents to use tools, retrieve information, maintain memory, and execute long-horizon workflows, evaluation remains largely model-centric, often reducing agents to final-task success while treating memory, retrieval, tool use, orchestration, verification, and governance as secondary implementation details.