Information Gravity: A Field-Theoretic Model for Token Selection in Large Language Models
Information Gravity: A Field-Theoretic Model for Token Selection in Large Language Models
We propose a theoretical model called "information gravity" to describe the text generation process in large language models (LLMs). The model uses physical apparatus from field theory and spacetime geometry to formalize the interaction between user queries and the probability distribution of generated tokens. A query is viewed as an object with "information mass" that curves the semantic space of the model, creating gravitational potential wells that "attract" tokens during generation. This model offers a mechanism to explain several observed phenomena in LLM behavior, including hallucinations (emerging from low-density semantic voids), sensitivity to query formulation (due to semantic field curvature changes), and the influence of sampling temperature on output diversity.
Maryna Vyshnyvetska
计算技术、计算机技术
Maryna Vyshnyvetska.Information Gravity: A Field-Theoretic Model for Token Selection in Large Language Models[EB/OL].(2025-04-29)[2025-07-01].https://arxiv.org/abs/2504.20951.点此复制
评论