A FRAMEWORK FOR GENERATING DISTRIBUTED-MEMORY PARALLEL PROGRAMS FOR BLOCK RECURSIVE ALGORITHMS

Citation
Sks. Gupta et al., A FRAMEWORK FOR GENERATING DISTRIBUTED-MEMORY PARALLEL PROGRAMS FOR BLOCK RECURSIVE ALGORITHMS, Journal of parallel and distributed computing, 34(2), 1996, pp. 137-153
Citations number
24
Categorie Soggetti
Computer Sciences","Computer Science Theory & Methods
ISSN journal
07437315
Volume
34
Issue
2
Year of publication
1996
Pages
137 - 153
Database
ISI
SICI code
0743-7315(1996)34:2<137:AFFGDP>2.0.ZU;2-2
Abstract
A framework for synthesizing communication-efficient distributed-memor y parallel programs for block recursive algorithms such as the fast Fo urier transform (FFT) and Strassen's matrix multiplication is presente d, This framework is based on an algebraic representation of the algor ithms, which involves the tensor (Kronecker) product and other matrix operations, This representation is useful in analyzing the communicati on implications of computation partitioning and data distributions, Th e programs are synthesized under two different target program models. These two models are based on different ways of managing the distribut ion of data for optimizing communication, The first model uses point-t o-point interprocessor communication primitives, whereas the second mo del uses data redistribution primitives involving collective all-to-ma ny communication. These two program models are shown to be suitable fo r different ranges of problem size, The methodology is illustrated by synthesizing communication-efficient programs for the FFT, This framew ork has been incorporated into the EXTENT system for automatic generat ion of parallel/vector programs for block recursive algorithms. (C) 19 96 Academic Press, Inc.