Adaptive Information Control for Search-Augmented LLM Reasoning 文章

ArXiv CS.CL2026-06-04NEWSen作者: Siheng Xiong, Oguzhan Gungordu, James C. Kerce, Faramarz Fekri

摘要

arXiv:2602.01672v2 Announce Type: replace Abstract: Search-augmented reasoning agents interleave multi-step reasoning with external retrieval, but uncontrolled retrieval can introduce redundant evidence, saturate the context, and destabilize reinforcement learning (RL). Existing outcome-based RL methods provide only sparse terminal rewards, offering limited guidance for intermediate information-acquisition decisions. We propose DeepControl, an adaptive information-control framework based on information utility, a state-dependent estimate of the marginal value of retrieved evidence. The framework regulates information acquisition along two axes: extent, i.e., whether retrieval should continue, and resolution, i.e., how much retrieved detail should be exposed. It implements these controls through retrieval-continuation guidance, hierarchical granularity control, and an annealed control-forcing scheme.

相关公司

暂无数据

相关人物

暂无数据