TaP: A Taxonomy-Guided Framework for Automated and Scalable Preference Data Generation
TaP: A Taxonomy-Guided Framework for Automated and Scalable Preference Data Generation
Conducting supervised fine-tuning and preference fine-tuning on large language models (LLMs) requires high-quality datasets to improve their ability to follow instructions and align with human preferences and values. However, constructing such datasets is resource-intensive, and most available datasets for supervised and preference fine-tuning are in English. To address these challenges, we propose the \underline{\textbf{Ta}}xonomy-Guided \underline{\textbf{P}}reference Data Generation (TaP) framework, which facilitates automated and scalable construction of preference datasets across various languages. TaP is grounded in a structured taxonomy that allows fine-grained control over dataset composition, thereby ensuring both diversity and comprehensive coverage. We employ TaP-generated datasets to perform supervised and preference fine-tuning on various LLMs. Experimental results demonstrate that LLMs trained on TaP-generated datasets outperform those trained on existing open-source datasets. Remarkably, LLMs trained on TaP-generated datasets surpass the performance of those trained on an open-source dataset that is 180 times larger.
Deyi Xiong、Renren Jin、Tianhao Shen、Xinwei Wu、Dan Shi、Haoran Sun、Wuwei Huang、Quandong Wang、Wei Liu、Jian Luan、Bin Wang
计算技术、计算机技术
Deyi Xiong,Renren Jin,Tianhao Shen,Xinwei Wu,Dan Shi,Haoran Sun,Wuwei Huang,Quandong Wang,Wei Liu,Jian Luan,Bin Wang.TaP: A Taxonomy-Guided Framework for Automated and Scalable Preference Data Generation[EB/OL].(2025-06-30)[2025-07-16].https://arxiv.org/abs/2506.23979.点此复制
评论