|国家预印本平台
首页|Information Gravity: A Field-Theoretic Model for Token Selection in Large Language Models

Information Gravity: A Field-Theoretic Model for Token Selection in Large Language Models

Information Gravity: A Field-Theoretic Model for Token Selection in Large Language Models

来源:Arxiv_logoArxiv
英文摘要

We propose a theoretical model called "information gravity" to describe the text generation process in large language models (LLMs). The model uses physical apparatus from field theory and spacetime geometry to formalize the interaction between user queries and the probability distribution of generated tokens. A query is viewed as an object with "information mass" that curves the semantic space of the model, creating gravitational potential wells that "attract" tokens during generation. This model offers a mechanism to explain several observed phenomena in LLM behavior, including hallucinations (emerging from low-density semantic voids), sensitivity to query formulation (due to semantic field curvature changes), and the influence of sampling temperature on output diversity.

Maryna Vyshnyvetska

10.5281/zenodo.15289890

计算技术、计算机技术

Maryna Vyshnyvetska.Information Gravity: A Field-Theoretic Model for Token Selection in Large Language Models[EB/OL].(2025-04-29)[2025-07-01].https://arxiv.org/abs/2504.20951.点此复制

评论