Adaptive Preference Optimization with Uncertainty-aware Utility Anchor 文章

ArXiv CS.CL2026-05-26NEWSen作者: Xiaobo Wang, Zixia Jia, Jiaqi Li, Qi Liu, Zilong Zheng

Adaptive Preference Optimization with Uncertainty-aware Utility Anchor · 相关技术