Retrieval, Reward, and Training Protocols: What Matters in Training Search Agents? 文章

ArXiv CS.CL2026-05-28NEWSen作者: Yibo Zhao, Zichen Ding, Jiayi Wu, Zun Wang, Xiang Li

摘要

arXiv:2605.27881v1 Announce Type: new Abstract: Search agents powered by large language models can autonomously decompose queries, retrieve information, and synthesize answers through multi-step reasoning. However, the rapid growth of training methods has outpaced controlled comparison: existing works differ in retrieval corpora, reward designs, and training protocols, making it unclear what actually drives improvements. We present a controlled empirical study that isolates three under-explored dimensions of search agent training. First, we identify a critical data-coverage issue in the widely used Wikipedia 2018 corpus and show that correcting it alone yields larger gains than the differences between training algorithms.

相关公司

暂无数据

相关人物

暂无数据

相关技术

暂无数据