首页|Two Experts Are All You Need for Steering Thinking: Reinforcing Cognitive Effort in MoE Reasoning Models Without Additional Training

Two Experts Are All You Need for Steering Thinking: Reinforcing Cognitive Effort in MoE Reasoning Models Without Additional Training

来源：

Arxiv

英文摘要

Mixture-of-Experts (MoE) architectures within Large Reasoning Models (LRMs) have achieved impressive reasoning capabilities by selectively activating experts to facilitate structured cognitive processes. Despite notable advances, existing reasoning models often suffer from cognitive inefficiencies like overthinking and underthinking. To address these limitations, we introduce a novel inference-time steering methodology called Reinforcing Cognitive Experts (RICE), designed to improve reasoning performance without additional training or complex heuristics. Leveraging normalized Pointwise Mutual Information (nPMI), we systematically identify specialized experts, termed ''cognitive experts'' that orchestrate meta-level reasoning operations characterized by tokens like ''<think>''. Empirical evaluations with leading MoE-based LRMs (DeepSeek-R1 and Qwen3-235B) on rigorous quantitative and scientific reasoning benchmarks demonstrate noticeable and consistent improvements in reasoning accuracy, cognitive efficiency, and cross-domain generalization. Crucially, our lightweight approach substantially outperforms prevalent reasoning-steering techniques, such as prompt design and decoding constraints, while preserving the model's general instruction-following skills. These results highlight reinforcing cognitive experts as a promising, practical, and interpretable direction to enhance cognitive efficiency within advanced reasoning models.

作者：Tian Liang、Qiuzhi Liu、Jiahao Xu、Mengru Wang、Xingyu Chen、Yue Wang、Ruotian Ma、Haitao Mi、Zhiwei He、Yunzhi Yao、Wenxuan Wang、Ningyu Zhang、Xiaolong Li、Dong Yu、Zhaopeng Tu

作者单位：

学科分类：计算技术、计算机技术

推荐引用：Tian Liang,Qiuzhi Liu,Jiahao Xu,Mengru Wang,Xingyu Chen,Yue Wang,Ruotian Ma,Haitao Mi,Zhiwei He,Yunzhi Yao,Wenxuan Wang,Ningyu Zhang,Xiaolong Li,Dong Yu,Zhaopeng Tu.Two Experts Are All You Need for Steering Thinking: Reinforcing Cognitive Effort in MoE Reasoning Models Without Additional Training[EB/OL].(2025-05-20)[2025-07-02].https://arxiv.org/abs/2505.14681.点此复制

Two Experts Are All You Need for Steering Thinking: Reinforcing Cognitive Effort in MoE Reasoning Models Without Additional Training

Two Experts Are All You Need for Steering Thinking: Reinforcing Cognitive Effort in MoE Reasoning Models Without Additional Training

评论