Learning to Adapt SFT Data for Better Reasoning Generalization 文章

ArXiv CS.CL2026-05-27NEWSen作者: Lisong Sun, Li Wang, Chen Zhang, Jinyang Wu, Kui Zhang, Tianhao Peng, Wenjun Wu

Learning to Adapt SFT Data for Better Reasoning Generalization · 相关技术