|国家预印本平台
首页|A Statistical Physics of Language Model Reasoning

A Statistical Physics of Language Model Reasoning

A Statistical Physics of Language Model Reasoning

来源:Arxiv_logoArxiv
英文摘要

Transformer LMs show emergent reasoning that resists mechanistic understanding. We offer a statistical physics framework for continuous-time chain-of-thought reasoning dynamics. We model sentence-level hidden state trajectories as a stochastic dynamical system on a lower-dimensional manifold. This drift-diffusion system uses latent regime switching to capture diverse reasoning phases, including misaligned states or failures. Empirical trajectories (8 models, 7 benchmarks) show a rank-40 projection (balancing variance capture and feasibility) explains ~50% variance. We find four latent reasoning regimes. An SLDS model is formulated and validated to capture these features. The framework enables low-cost reasoning simulation, offering tools to study and predict critical transitions like misaligned states or other LM failures.

Jack David Carson、Amir Reisizadeh

计算技术、计算机技术物理学

Jack David Carson,Amir Reisizadeh.A Statistical Physics of Language Model Reasoning[EB/OL].(2025-06-04)[2025-06-18].https://arxiv.org/abs/2506.04374.点此复制

评论