UCPO: Uncertainty-Aware Policy Optimization 文章

ArXiv CS.AI2026-05-27NEWSen作者: Xianzhou Zeng, Jing Huang, Chunmei Xie, Gongrui Nan, Siye Chen, Mengyu Lu, Weiqi Xiong, Qixuan Zhou, Junhao Zhang, Qiang Zhu, Yadong Li, Xingzhong Xu

UCPO: Uncertainty-Aware Policy Optimization · 相关技术