|国家预印本平台
首页|Rethinking LLM Training through Information Geometry and Quantum Metrics

Rethinking LLM Training through Information Geometry and Quantum Metrics

Rethinking LLM Training through Information Geometry and Quantum Metrics

来源:Arxiv_logoArxiv
英文摘要

Optimization in large language models (LLMs) unfolds over high-dimensional parameter spaces with non-Euclidean structure. Information geometry frames this landscape using the Fisher information metric, enabling more principled learning via natural gradient descent. Though often impractical, this geometric lens clarifies phenomena such as sharp minima, generalization, and observed scaling laws. We argue that curvature-aware approaches deepen our understanding of LLM training. Finally, we speculate on quantum analogies based on the Fubini-Study metric and Quantum Fisher Information, hinting at efficient optimization in quantum-enhanced systems.

Riccardo Di Sipio

数学物理学计算技术、计算机技术

Riccardo Di Sipio.Rethinking LLM Training through Information Geometry and Quantum Metrics[EB/OL].(2025-07-02)[2025-07-09].https://arxiv.org/abs/2506.15830.点此复制

评论