首页|Let Me Think! A Long Chain-of-Thought Can Be Worth Exponentially Many Short Ones

Let Me Think! A Long Chain-of-Thought Can Be Worth Exponentially Many Short Ones

来源：

英文摘要

Inference-time computation has emerged as a promising scaling axis for improving large language model reasoning. However, despite yielding impressive performance, the optimal allocation of inference-time computation remains poorly understood. A central question is whether to prioritize sequential scaling (e.g., longer chains of thought) or parallel scaling (e.g., majority voting across multiple short chains of thought). In this work, we seek to illuminate the landscape of test-time scaling by demonstrating the existence of reasoning settings where sequential scaling offers an exponential advantage over parallel scaling. These settings are based on graph connectivity problems in challenging distributions of graphs. We validate our theoretical findings with comprehensive experiments across a range of language models, including models trained from scratch for graph connectivity with different chain of thought strategies as well as large reasoning models.

作者：Parsa Mirtaheri、Ezra Edelman、Samy Jelassi、Eran Malach、Enric Boix-Adsera

作者单位：

学科分类：计算技术、计算机技术

推荐引用：Parsa Mirtaheri,Ezra Edelman,Samy Jelassi,Eran Malach,Enric Boix-Adsera.Let Me Think! A Long Chain-of-Thought Can Be Worth Exponentially Many Short Ones[EB/OL].(2025-05-27)[2025-07-16].https://arxiv.org/abs/2505.21825.点此复制

Let Me Think! A Long Chain-of-Thought Can Be Worth Exponentially Many Short Ones

Let Me Think! A Long Chain-of-Thought Can Be Worth Exponentially Many Short Ones

评论