|国家预印本平台
首页|ToolGrad: Efficient Tool-use Dataset Generation with Textual "Gradients"

ToolGrad: Efficient Tool-use Dataset Generation with Textual "Gradients"

ToolGrad: Efficient Tool-use Dataset Generation with Textual "Gradients"

来源:Arxiv_logoArxiv
英文摘要

Prior work synthesizes tool-use LLM datasets by first generating a user query, followed by complex tool-use annotations like DFS. This leads to inevitable annotation failures and low efficiency in data generation. We introduce ToolGrad, an agentic framework that inverts this paradigm. ToolGrad first constructs valid tool-use chains through an iterative process guided by textual "gradients", and then synthesizes corresponding user queries. This "answer-first" approach led to ToolGrad-5k, a dataset generated with more complex tool use, lower cost, and 100% pass rate. Experiments show that models trained on ToolGrad-5k outperform those on expensive baseline datasets and proprietary LLMs, even on OOD benchmarks.

Zhongyi Zhou、Kohei Uehara、Haoyu Zhang、Jingtao Zhou、Lin Gu、Ruofei Du、Zheng Xu、Tatsuya Harada

计算技术、计算机技术

Zhongyi Zhou,Kohei Uehara,Haoyu Zhang,Jingtao Zhou,Lin Gu,Ruofei Du,Zheng Xu,Tatsuya Harada.ToolGrad: Efficient Tool-use Dataset Generation with Textual "Gradients"[EB/OL].(2025-08-06)[2025-08-16].https://arxiv.org/abs/2508.04086.点此复制

评论