GAC: Noise-Aware Adaptive Mixing for Hybrid SFT-RL Post-Training 文章

ArXiv CS.AI2026-05-27NEWSen作者: Yuelin Hu, Zhenbo Yu, Zhengxue Cheng, Wei Liu, Li Song

GAC: Noise-Aware Adaptive Mixing for Hybrid SFT-RL Post-Training · 相关事件

相关事件