PlanningBench: Generating Scalable and Verifiable Planning Data for Evaluating and Training Large Language Models 文章

ArXiv CS.AI2026-06-01NEWSen作者: Ziliang Zhao, Zenan Xu, Shuting Wang, Hongjin Qian, Yan Lei, Minda Hu, Zhao Wang, Shihan Dou, Zhicheng Dou, Pluto Zhou

PlanningBench: Generating Scalable and Verifiable Planning Data for Evaluating and Training Large Language Models · 相关技术