Parallelizing the Approximate Minimum Degree Ordering Algorithm: Strategies and Evaluation
Parallelizing the Approximate Minimum Degree Ordering Algorithm: Strategies and Evaluation
The approximate minimum degree algorithm is widely used before numerical factorization to reduce fill-in for sparse matrices. While considerable attention has been given to the numerical factorization process, less focus has been placed on parallelizing the approximate minimum degree algorithm itself. In this paper, we explore different parallelization strategies, and introduce a novel parallel framework that leverages multiple elimination on distance-2 independent sets. Our evaluation shows that parallelism within individual elimination steps is limited due to low computational workload and significant memory contention. In contrast, our proposed framework overcomes these challenges by parallelizing the work across elimination steps. To the best of our knowledge, our implementation is the first scalable shared memory implementation of the approximate minimum degree algorithm. Experimental results show that we achieve up to an 8.30x speedup using 64 threads over the state-of-the-art sequential implementation in SuiteSparse.
Yen-Hsiang Chang、Ayd?n Bulu?、James Demmel
计算技术、计算机技术
Yen-Hsiang Chang,Ayd?n Bulu?,James Demmel.Parallelizing the Approximate Minimum Degree Ordering Algorithm: Strategies and Evaluation[EB/OL].(2025-04-23)[2025-07-16].https://arxiv.org/abs/2504.17097.点此复制
评论