UR$^2$: Unify RAG and Reasoning through Reinforcement Learning 文章

ArXiv CS.CL2026-06-03NEWSen作者: Weitao Li, Boran Xiang, Xiaolong Wang, Zhinan Gou, Weizhi Ma, Yang Liu

UR$^2$: Unify RAG and Reasoning through Reinforcement Learning · 相关技术