Bilevel Optimization of Synthetic Trajectories for Multi-Turn LLM Fine-Tuning 文章

ArXiv CS.AI2026-05-26NEWSen作者: Shresth Verma, Mauricio Tec, Cheol Woo Kim, Kai Wang, Milind Tambe

Bilevel Optimization of Synthetic Trajectories for Multi-Turn LLM Fine-Tuning · 相关技术