A. Povitsky et Pj. Morris, A higher-order compact method in space and time based on parallel implementation of the Thomas algorithm, J COMPUT PH, 161(1), 2000, pp. 182-203
In this study we propose a novel method to parallelize high-order compact n
umerical algorithms for the solution of three-dimensional PDEs in a space-t
ime domain. For such a numerical integration most of the computer time is s
pent in computation of spatial derivatives at each stage of the Runge-Kutta
temporal update. The most efficient direct method to compute spatial deriv
atives on a serial computer is a version of Gaussian elimination for narrow
linear banded systems known as the Thomas algorithm. In a straightforward
pipelined implementation of the Thomas algorithm processors are idle due to
the forward and backward recurrences of the Thomas algorithm. To utilize p
rocessors during this time, we propose to use them for either nonlocal data
-independent computations, solving lines in the next spatial direction, or
local data-dependent computations by the Runge-Kutta method. To achieve thi
s goal, control of processor communication and computations by a static sch
edule is adopted. Thus, our parallel code is driven by a communication and
computation schedule instead of the usual "creative programming" approach.
The obtained parallelization speed-up of the novel algorithm is about twice
as much as that for the basic pipelined algorithm and close to that for th
e explicit DRP algorithm. Use of the algorithm is demonstrated and comparis
ons with other schemes are given. (C) 2000 Academic Press.