|国家预印本平台
首页|Thinking Isn't an Illusion: Overcoming the Limitations of Reasoning Models via Tool Augmentations

Thinking Isn't an Illusion: Overcoming the Limitations of Reasoning Models via Tool Augmentations

Thinking Isn't an Illusion: Overcoming the Limitations of Reasoning Models via Tool Augmentations

来源:Arxiv_logoArxiv
英文摘要

Large Reasoning Models (LRMs) have become a central focus in today's large language model (LLM) research, where models are designed to output a step-by-step thinking process before arriving at a final answer to handle complex reasoning tasks. Despite their promise, recent empirical studies (e.g., [Shojaee et al., 2025] from Apple) suggest that this thinking process may not actually enhance reasoning ability, where LLMs without explicit reasoning actually outperform LRMs on tasks with low or high complexity. In this work, we revisit these findings and investigate whether the limitations of LRMs persist when tool augmentations are introduced. We incorporate two types of tools, Python interpreters and scratchpads, and evaluate three representative LLMs and their LRM counterparts on Apple's benchmark reasoning puzzles. Our results show that, with proper tool use, LRMs consistently outperform their non-reasoning counterparts across all levels of task complexity. These findings challenge the recent narrative that reasoning is an illusion and highlight the potential of tool-augmented LRMs for solving complex problems.

Zhao Song、Song Yue、Jiahao Zhang

计算技术、计算机技术

Zhao Song,Song Yue,Jiahao Zhang.Thinking Isn't an Illusion: Overcoming the Limitations of Reasoning Models via Tool Augmentations[EB/OL].(2025-07-23)[2025-08-10].https://arxiv.org/abs/2507.17699.点此复制

评论