Agent JIT Compilation for Latency-Optimizing Web Agent Planning and Scheduling 文章

ArXiv CS.AI2026-06-01NEWSen作者: Caleb Winston, Ron Yifeng Wang, Azalia Mirhoseini, Christos Kozyrakis

摘要

arXiv:2605.21470v2 Announce Type: replace-cross Abstract: Computer-use agents (CUAs) automate tasks specified with natural language such as "order the cheapest item from Taco Bell" by generating sequences of calls to tools such as click, type, and scroll on a browser. Current implementations follow a sequential fetch-screenshot-execute loop where each iteration requires an LLM call, resulting in high latency and frequent errors from incorrect tool use. We present agent just-in-time (JIT) compilation, a system that compiles task descriptions directly into executable code that may include LLM calls, tool calls, and parallelization. Our approach comprises three components: (1) JIT-Planner, which generates multiple code plans, validates each against tool specifications, and selects the minimum-cost candidate; (2) JIT-Scheduler, which explores parallelization strategies via Monte Carlo cost estimation from learned latency distributions;