Jg. Kennedy et al., IMPLEMENTATION OF IMPLICIT FINITE-ELEMENT METHODS FOR INCOMPRESSIBLE FLOWS ON THE CM-5, Computer methods in applied mechanics and engineering, 119(1-2), 1994, pp. 95-111
A parallel implementation of an implicit finite element formulation fo
r incompressible fluids on a distributed-memory massively parallel com
puter is presented. The dominant issue that distinguishes the implemen
tation of finite element problems on distributed-memory computers from
that on traditional shared-memory scalar or vector computers is the d
istribution of data (and hence workload) to the processors and the non
-uniform memory hierarchy associated with the processors, particularly
the non-uniform costs associated with on-processor and off-processor
memory references. Accessing data stored in a remote processor require
s computing resources an order of magnitude greater than accessing dat
a locally in a processor. This distribution of data motivates the deve
lopment of alternatives to traditional algorithms and data structures
designed for shared-memory computers, which must now account for distr
ibuted-memory architectures. Data structures as well as data decomposi
tion and data communication algorithms designed for distributed-memory
computers are presented in the context of high level language constru
cts from High Performance Fortran. The discussion relies primarily on
abstract features of the hardware and software environment and should
be applicable, in principle, to a variety of distributed-memory system
. The actual implementation is carried out on a Connection Machine CM-
5 system with high performance communication functions.