A general parallel sparse-blocked matrix multiply for linear scaling SCF theory

Authors
Citation
M. Challacombe, A general parallel sparse-blocked matrix multiply for linear scaling SCF theory, COMP PHYS C, 128(1-2), 2000, pp. 93-107
Citations number
102
Categorie Soggetti
Physics
Journal title
COMPUTER PHYSICS COMMUNICATIONS
ISSN journal
00104655 → ACNP
Volume
128
Issue
1-2
Year of publication
2000
Pages
93 - 107
Database
ISI
SICI code
0010-4655(200006)128:1-2<93:AGPSMM>2.0.ZU;2-D
Abstract
A general approach to the parallel sparse-blocked matrix-matrix multiply is developed in the context of linear scaling self-consistent-field (SCF) the ory. The data-parallel message passing method uses non-blocking communicati on to overlap computation and communication. The space filling curve heuris tic is used to achieve data locality for sparse matrix elements that decay with "separation". Load balance is achieved by solving the bin packing prob lem for blocks with variable size. With this new method as the kernel, parallel performance of the simplified density matrix minimization (SDMM) for solution of the SCF equations is inv estigated for RHF/6-31G** water clusters and RHF/3-21G estane globules. Sus tained rates above 5.7 GFLOPS for the SDMM have been achieved for (H2O)(200 ) with 95 Origin 2000 processors. Scalability is found to be limited by loa d imbalance, which increases with decreasing granularity, due primarily to the inhomogeneous distribution of variable block sizes. Published by Elsevi er Science B.V.