Scalable Data-Driven Basis Selection for Linear Machine Learning Interatomic Potentials
Scalable Data-Driven Basis Selection for Linear Machine Learning Interatomic Potentials
Machine learning interatomic potentials (MLIPs) provide an effective approach for accurately and efficiently modeling atomic interactions, expanding the capabilities of atomistic simulations to complex systems. However, a priori feature selection leads to high complexity, which can be detrimental to both computational cost and generalization, resulting in a need for hyperparameter tuning. We demonstrate the benefits of active set algorithms for automated data-driven feature selection. The proposed methods are implemented within the Atomic Cluster Expansion (ACE) framework. Computational tests conducted on a variety of benchmark datasets indicate that sparse ACE models consistently enhance computational efficiency, generalization accuracy and interpretability over dense ACE models. An added benefit of the proposed algorithms is that they produce entire paths of models with varying cost/accuracy ratio.
Tina Torabi、Matthias Militzer、Michael P. Friedlander、Christoph Ortner
计算技术、计算机技术物理学
Tina Torabi,Matthias Militzer,Michael P. Friedlander,Christoph Ortner.Scalable Data-Driven Basis Selection for Linear Machine Learning Interatomic Potentials[EB/OL].(2025-04-23)[2025-06-29].https://arxiv.org/abs/2504.16418.点此复制
评论