|国家预印本平台
首页|Microscopic and collective signatures of feature learning in neural networks

Microscopic and collective signatures of feature learning in neural networks

Microscopic and collective signatures of feature learning in neural networks

来源:Arxiv_logoArxiv
英文摘要

Feature extraction - the ability to identify relevant properties of data - is a key factor underlying the success of deep learning. Yet, it has proved difficult to elucidate its nature within existing predictive theories, to the extent that there is no consensus on the very definition of feature learning. A promising hint in this direction comes from previous phenomenological observations of quasi-universal aspects in the training dynamics of neural networks, displayed by simple properties of feature geometry. We address this problem within a statistical-mechanics framework for Bayesian learning in one hidden layer neural networks with standard parameterization. Analytical computations in the proportional limit (when both the network width and the size of the training set are large) can quantify fingerprints of feature learning, both collective ones (related to manifold geometry) and microscopic ones (related to the weights). In particular, (i) the distance between different class manifolds in feature space is a nonmonotonic function of the temperature, which we interpret as the equilibrium counterpart of a phenomenon observed under gradient descent (GD) dynamics, and (ii) the microscopic learnable parameters in the network undergo a finite data-dependent displacement with respect to the infinite-width limit, and develop correlations. These results indicate that nontrivial feature learning is at play in a regime where the posterior predictive distribution is that of Gaussian process regression with a trivially rescaled prior.

Andrea Corti、Rosalba Pacelli、Pietro Rotondo、Marco Gherardi

计算技术、计算机技术

Andrea Corti,Rosalba Pacelli,Pietro Rotondo,Marco Gherardi.Microscopic and collective signatures of feature learning in neural networks[EB/OL].(2025-08-28)[2025-09-02].https://arxiv.org/abs/2508.20989.点此复制

评论