首页|Less Data Less Tokens: Multilingual Unification Learning for Efficient Test-Time Reasoning in LLMs

Less Data Less Tokens: Multilingual Unification Learning for Efficient Test-Time Reasoning in LLMs

来源：

英文摘要

This paper explores the challenges of test-time scaling of large language models (LLMs), regarding both the data and inference efficiency. We highlight the diversity of multi-lingual reasoning based on our pilot studies, and then introduce a novel approach, \(L^2\) multi-lingual unification learning with a decoding intervention strategy for further investigation. The basic idea of \(L^2\) is that the reasoning process varies across different languages, which may be mutually beneficial to enhance both model performance and efficiency. In specific, there are two types of multi-lingual data: the entire long chain-of-thought annotations in different languages and the step-wise mixture of languages. By further tuning based on them, we show that even small amounts of data can significantly improve reasoning capabilities. Our findings suggest that multilingual learning reduces both the required data and the number of inference tokens while maintaining a comparable performance. Furthermore, \(L^2\) is orthogonal to other data efficient methods. Thus, we also emphasize the importance of diverse data selection. The \(L^2\) method offers a promising solution to the challenges of data collection and test-time compute efficiency in LLMs.

作者：Kang Chen、Mengdi Zhang、Yixin Cao

作者单位：

学科分类：语言学

推荐引用：Kang Chen,Mengdi Zhang,Yixin Cao.Less Data Less Tokens: Multilingual Unification Learning for Efficient Test-Time Reasoning in LLMs[EB/OL].(2025-06-23)[2025-07-16].https://arxiv.org/abs/2506.18341.点此复制

Less Data Less Tokens: Multilingual Unification Learning for Efficient Test-Time Reasoning in LLMs

Less Data Less Tokens: Multilingual Unification Learning for Efficient Test-Time Reasoning in LLMs

评论