Denoising Application Performance Models with Noise-Resilient Priors
Denoising Application Performance Models with Noise-Resilient Priors
When scaling parallel codes to larger machines, performance models help identify potential bottlenecks. Since analytically designing these mathematical representations is usually challenging, empirical models based on performance measurements offer a practical alternative. Yet, measurements on HPC systems are typically affected by noise, leading to potentially misleading model predictions. To reduce the influence of noise, we introduce application-specific dynamic priors into the modeling process, which we derive from noise-resilient measurements of computational effort and knowledge of typical algorithms used in communication routines. These priors then narrow the search space for our performance models, excluding complexity classes that reflect noise rather than performance. Our approach keeps the models much closer to theoretical expectations and significantly improves their predictive power. Finally, it cuts experimental costs in half by minimizing the number of repeated measurements.
Gustavo de Morais、Alexander Gei?、Alexandru Calotoiu、Gregor Corbin、Ahmad Tarraf、Torsten Hoefler、Bernd Mohr、Felix Wolf
计算技术、计算机技术
Gustavo de Morais,Alexander Gei?,Alexandru Calotoiu,Gregor Corbin,Ahmad Tarraf,Torsten Hoefler,Bernd Mohr,Felix Wolf.Denoising Application Performance Models with Noise-Resilient Priors[EB/OL].(2025-04-15)[2025-04-30].https://arxiv.org/abs/2504.10996.点此复制
评论