HiPER: Hierarchical Reinforcement Learning with Explicit Credit Assignment for Large Language Model Agents 文章

ArXiv CS.AI2026-06-01NEWSen作者: Jiangweizhi Peng, Yuanxin Liu, Ruida Zhou, Charles Fleming, Zhaoran Wang, Alfredo Garcia, Mingyi Hong

HiPER: Hierarchical Reinforcement Learning with Explicit Credit Assignment for Large Language Model Agents · 相关人物

暂无数据