S+: Efficient 2D sparse LU factorization on parallel machines

Citation
K. Shen et al., S+: Efficient 2D sparse LU factorization on parallel machines, SIAM J MATR, 22(1), 2000, pp. 282-305
Citations number
29
Categorie Soggetti
Mathematics
Journal title
SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS
ISSN journal
08954798 → ACNP
Volume
22
Issue
1
Year of publication
2000
Pages
282 - 305
Database
ISI
SICI code
0895-4798(20000620)22:1<282:SE2SLF>2.0.ZU;2-F
Abstract
Static symbolic factorization coupled with supernode partitioning and async hronous computation scheduling can achieve high gigaflop rates for parallel sparse LU factorization with partial pivoting. This paper studies properti es of elimination forests and uses them to optimize supernode partitioning/ amalgamation and execution scheduling. It also proposes supernodal matrix m ultiplication to speed up kernel computation by retaining the BLAS-3 level efficiency and avoiding unnecessary arithmetic operations. The experiments show that our new design with proper space optimization, called S+, improve s our previous solution substantially and can achieve up to 10 GFLOPS on 12 8 Cray T3E 450MHz nodes.