ToolGrad: Efficient Tool-use Dataset Generation with Textual "Gradients" 文章

ArXiv CS.CL2026-06-18NEWSen作者: Zhongyi Zhou, Kohei Uehara, Haoyu Zhang, Jingtao Zhou, Lin Gu, Ruofei Du, Zheng Xu, Tatsuya Harada

查看原文 →

关系图谱

详细信息

来源站点: ArXiv CS.CL
作者: Zhongyi Zhou, Kohei Uehara, Haoyu Zhang, Jingtao Zhou, Lin Gu, Ruofei Du, Zheng Xu, Tatsuya Harada
文章类型: NEWS
语言: en
发布日期: 2026-06-18

原文

摘要

arXiv:2508.04086v3 Announce Type: replace Abstract: Prior work synthesizes tool-use LLM datasets by first generating a user query, followed by complex tool-use annotations like depth-first search (DFS). This leads to inevitable annotation failures and low efficiency in data generation. We introduce ToolGrad, an agentic framework that inverts this paradigm. ToolGrad first constructs valid tool-use chains through an iterative process guided by textual "gradients", and then synthesizes corresponding user queries. This "answer-first" approach led to ToolGrad-500, a dataset generated with more complex tool use, lower cost, and almost 100% pass rate. Experiments show that ToolGrad models outperform those trained on expensive baseline datasets and proprietary LLMs. The ToolGrad source code, dataset, and models are available at https://github.com/zhongyi-zhou/toolgrad.

ToolGrad: Efficient Tool-use Dataset Generation with Textual "Gradients" 文章

详细信息

摘要

相关事件

相关公司

相关人物查看全部 (1)

相关产品查看全部 (9)

相关技术查看全部 (3)