Parallel nested dissection |
| |
Authors: | John M Conroy |
| |
Affiliation: | Supercomputing Research Center, 17100 Science Drive, Bowie, Maryland 20715-4300, USA |
| |
Abstract: | Nested dissection is a very popular direct method for solving sparse linear systems that arise from finite difference and finite element methods. Worley and Schreiber 16] give a fine grain algorithm for a square array of processors. Their algorithm uses O(N2) processors, each with O(N) memory, to factor an N2 by N2 sparse matrix whose graphs is an N × N mesh. The efficiency of their method is between 1/46 and 1/12. George et al. 6] 8] give a medium grain algorithm for hypercube architecture, while George et al. 7] give an algorithm for shared memory machines. These papers present a column oriented approach which can exploit O(N) parallelism and yield efficiencies up to 50%. Lucas 11] also gives a column oriented scheme which achieves up to 75% efficiency and O(N) parallelism. In this paper, we present a medium to fine grain algorithm for a P × P array of processors with local memory. This algorithm can exploit up to O(N2) parallelism. The efficiency of the fine grain version is comparable to 16] while as a medium grain algorithm achieves about 49% efficiency. The strength of the method is due to three factors: its ability to pipeline much of the computation, overlapping computation and communication, and the use of level 3 BLAS like primitives. In addition to its high efficiency its memory requirement is optimal, only O(N2 log N/P2) words memory is needed per processor. |
| |
Keywords: | Nested dissection Sparse linear systems solver Block algorithm Cholesky factorization Efficiency analysis |
本文献已被 ScienceDirect 等数据库收录! |
|