Retrieval, Reward, and Training Protocols: What Matters in Training Search Agents? 文章

ArXiv CS.CL2026-05-28NEWSen作者: Yibo Zhao, Zichen Ding, Jiayi Wu, Zun Wang, Xiang Li

摘要

arXiv:2605.27881v1 Announce Type: new Abstract: Search agents powered by large language models can autonomously decompose queries, retrieve information, and synthesize answers through multi-step reasoning. However, the rapid growth of training methods has outpaced controlled comparison: existing works differ in retrieval corpora, reward designs, and training protocols, making it unclear what actually drives improvements. We present a controlled empirical study that isolates three under-explored dimensions of search agent training. First, we identify a critical data-coverage issue in the widely used Wikipedia 2018 corpus and show that correcting it alone yields larger gains than the differences between training algorithms.

Retrieval, Reward, and Training Protocols: What Matters in Training Search Agents? 文章

摘要

相关事件查看全部 (1)

相关公司

相关人物

相关产品查看全部 (2)

相关技术