ORACLE-SWE: Quantifying the Contribution of Oracle Information Signals on SWE Agents 文章
摘要
arXiv:2604.07789v2 Announce Type: replace-cross Abstract: Recent advances in language model (LM) agents have significantly improved automated software engineering (SWE). Prior work has proposed various agentic workflows and training strategies as well as analyzed failure modes of agentic systems on SWE tasks, focusing on several contextual information signals: Reproduction Test, Regression Test, Edit Location, Execution Context, and API Usage. However, the individual contribution of each signal to overall success remains underexplored, particularly their ideal contribution when intermediate information is perfectly obtained. To address this gap, we introduce Oracle-SWE, a unified method to isolate and extract oracle information signals from SWE benchmarks and quantify the impact of each signal on agent performance.
相关事件查看全部 (1)
相关公司
暂无数据
相关人物
暂无数据