Post-Training LLMs as Better Decision-Making Agents: A Regret-Minimization Approach 文章

ArXiv CS.AI2026-06-01NEWSen作者: Chanwoo Park, Ziyang Chen, Asuman Ozdaglar, Kaiqing Zhang

Post-Training LLMs as Better Decision-Making Agents: A Regret-Minimization Approach · 相关人物

暂无数据