Ask or Assume? Uncertainty-Aware Clarification-Seeking in Coding Agents 文章

ArXiv CS.CL2026-06-05NEWSen作者: Nicholas Edwards, Sebastian Schuster

详细信息

来源站点: ArXiv CS.CL
作者: Nicholas Edwards, Sebastian Schuster
文章类型: NEWS
语言: en
发布日期: 2026-06-05

摘要

arXiv:2603.26233v2 Announce Type: replace Abstract: As Large Language Model (LLM) agents are increasingly deployed in open-ended domains like software engineering, they frequently encounter underspecified instructions that lack crucial context. While human developers naturally resolve underspecification by asking clarifying questions, current agents are largely optimized for autonomous execution. In this work, we systematically evaluate the clarification-seeking abilities of LLM agents on an underspecified variant of SWE-bench Verified. We propose an uncertainty-aware multi-agent scaffold that decouples underspecification detection from code execution. Across both proprietary and open-weight frontier LLMs, our scaffold achieves a 69.40% task resolve rate, significantly outperforming a standard single-agent setup and closing the performance gap with agents operating on fully specified instructions.

Ask or Assume? Uncertainty-Aware Clarification-Seeking in Coding Agents 文章

详细信息

摘要

相关事件

相关公司

相关人物

相关产品查看全部 (3)

相关技术查看全部 (3)