首页|Information Gravity: A Field-Theoretic Model for Token Selection in Large Language Models

Information Gravity: A Field-Theoretic Model for Token Selection in Large Language Models

来源：

英文摘要

We propose a theoretical model called "information gravity" to describe the text generation process in large language models (LLMs). The model uses physical apparatus from field theory and spacetime geometry to formalize the interaction between user queries and the probability distribution of generated tokens. A query is viewed as an object with "information mass" that curves the semantic space of the model, creating gravitational potential wells that "attract" tokens during generation. This model offers a mechanism to explain several observed phenomena in LLM behavior, including hallucinations (emerging from low-density semantic voids), sensitivity to query formulation (due to semantic field curvature changes), and the influence of sampling temperature on output diversity.

作者：Maryna Vyshnyvetska

作者单位：

DOI：10.5281/zenodo.15289890

学科分类：计算技术、计算机技术

推荐引用：Maryna Vyshnyvetska.Information Gravity: A Field-Theoretic Model for Token Selection in Large Language Models[EB/OL].(2025-04-29)[2025-07-01].https://arxiv.org/abs/2504.20951.点此复制

Information Gravity: A Field-Theoretic Model for Token Selection in Large Language Models

Information Gravity: A Field-Theoretic Model for Token Selection in Large Language Models

评论