Clarification Is Not Enough: Post-Clarification Answering Remains the Bottleneck in Multi-Turn QA 文章

ArXiv CS.CL2026-05-26NEWSen作者: Jinyan Su, Jennifer Healey

摘要

arXiv:2605.25204v1 Announce Type: new Abstract: Pluralistic alignment requires systems to adapt to diverse user values, communication styles, and contextual assumptions. We believe that a foundational prerequisite for such alignment enabling accurate preference elicitation from people when their intent is under-specified or ambiguous. We study the problem of preference elicitation in multi-turn question answering by decomposing the problem into two components: a \textbf{clarification policy}, which decides whether to ask a clarifying question or answer directly, and \textbf{post-clarification answering}, which produces the correct final answer once the missing information is provided. We show, using the PACIFIC benchmark, that supervised fine-tuning rapidly improves the clarification policy, however, final answer accuracy remains substantially lower even when the model takes the correct action.

相关公司

暂无数据

相关人物

暂无数据

相关技术

暂无数据