An accuracy-runtime trade-off comparison of scalable Gaussian process approximations for spatial data
An accuracy-runtime trade-off comparison of scalable Gaussian process approximations for spatial data
Gaussian processes (GPs) are flexible, probabilistic, non-parametric models widely employed in various fields such as spatial statistics and machine learning. A drawback of Gaussian processes is their computational cost having $\mathcal{O}(N^3)$ time and $\mathcal{O}(N^2)$ memory complexity which makes them prohibitive for large data sets. Numerous approximation techniques have been proposed to address this limitation. In this work, we systematically compare the accuracy of different Gaussian process approximations concerning likelihood evaluation, parameter estimation, and prediction taking into account the computational time required to perform these tasks. In other words, we analyze the trade-off between accuracy and runtime on multiple simulated and large-scale real-world data sets. We find that Vecchia approximations consistently emerge as the most accurate in almost all experiments.
Filippo Rambelli、Fabio Sigrist
自然科学研究方法
Filippo Rambelli,Fabio Sigrist.An accuracy-runtime trade-off comparison of scalable Gaussian process approximations for spatial data[EB/OL].(2025-06-24)[2025-07-16].https://arxiv.org/abs/2501.11448.点此复制
评论