ITA
ENG

S+: Efficient 2D sparse LU factorization on parallel machines

Authors

Shen, K Yang, T Jiao, XM

Citation

K. Shen et al., S+: Efficient 2D sparse LU factorization on parallel machines, SIAM J MATR, 22(1), 2000, pp. 282-305

Citations number

Categorie Soggetti

Mathematics

Journal title

SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS

ISSN journal

08954798 → ACNP

Volume

Issue

Year of publication

2000

Pages

282 - 305

Database

ISI

SICI code

0895-4798(20000620)22:1<282:SE2SLF>2.0.ZU;2-F

Abstract

Static symbolic factorization coupled with supernode partitioning and async hronous computation scheduling can achieve high gigaflop rates for parallel sparse LU factorization with partial pivoting. This paper studies properti es of elimination forests and uses them to optimize supernode partitioning/ amalgamation and execution scheduling. It also proposes supernodal matrix m ultiplication to speed up kernel computation by retaining the BLAS-3 level efficiency and avoiding unnecessary arithmetic operations. The experiments show that our new design with proper space optimization, called S+, improve s our previous solution substantially and can achieve up to 10 GFLOPS on 12 8 Cray T3E 450MHz nodes.