Heterogeneous Ensemble Enables a Universal Uncertainty Metric for Atomistic Foundation Models
Heterogeneous Ensemble Enables a Universal Uncertainty Metric for Atomistic Foundation Models
Universal machine learning interatomic potentials (uMLIPs) are reshaping atomistic simulation as foundation models, delivering near \textit{ab initio} accuracy at a fraction of the cost. Yet the lack of reliable, general uncertainty quantification limits their safe, wide-scale use. Here we introduce a unified, scalable uncertainty metric \(U\) based on a heterogeneous model ensemble with reuse of pretrained uMLIPs. Across chemically and structurally diverse datasets, \(U\) shows a strong correlation with the true prediction errors and provides a robust ranking of configuration-level risk. Leveraging this metric, we propose an uncertainty-aware model distillation framework to produce system-specific potentials: for W, an accuracy comparable to full-DFT training is achieved using only \(4\%\) of the DFT labels; for MoNbTaW, no additional DFT calculations are required. Notably, by filtering numerical label noise, the distilled models can, in some cases, surpass the accuracy of the DFT reference labels. The uncertainty-aware approach offers a practical monitor of uMLIP reliability in deployment, and guides data selection and fine-tuning strategies, thereby advancing the construction and safe use of foundation models and enabling cost-efficient development of accurate, system-specific potentials.
Kai Liu、Zixiong Wei、Wei Gao、Poulumi Dey、Marcel H. F. Sluiter、Fei Shuang
计算技术、计算机技术化学物理学
Kai Liu,Zixiong Wei,Wei Gao,Poulumi Dey,Marcel H. F. Sluiter,Fei Shuang.Heterogeneous Ensemble Enables a Universal Uncertainty Metric for Atomistic Foundation Models[EB/OL].(2025-07-28)[2025-08-11].https://arxiv.org/abs/2507.21297.点此复制
评论