A higher-order compact method in space and time based on parallel implementation of the Thomas algorithm

Citation
A. Povitsky et Pj. Morris, A higher-order compact method in space and time based on parallel implementation of the Thomas algorithm, J COMPUT PH, 161(1), 2000, pp. 182-203
Citations number
21
Categorie Soggetti
Physics
Journal title
JOURNAL OF COMPUTATIONAL PHYSICS
ISSN journal
00219991 → ACNP
Volume
161
Issue
1
Year of publication
2000
Pages
182 - 203
Database
ISI
SICI code
0021-9991(20000610)161:1<182:AHCMIS>2.0.ZU;2-4
Abstract
In this study we propose a novel method to parallelize high-order compact n umerical algorithms for the solution of three-dimensional PDEs in a space-t ime domain. For such a numerical integration most of the computer time is s pent in computation of spatial derivatives at each stage of the Runge-Kutta temporal update. The most efficient direct method to compute spatial deriv atives on a serial computer is a version of Gaussian elimination for narrow linear banded systems known as the Thomas algorithm. In a straightforward pipelined implementation of the Thomas algorithm processors are idle due to the forward and backward recurrences of the Thomas algorithm. To utilize p rocessors during this time, we propose to use them for either nonlocal data -independent computations, solving lines in the next spatial direction, or local data-dependent computations by the Runge-Kutta method. To achieve thi s goal, control of processor communication and computations by a static sch edule is adopted. Thus, our parallel code is driven by a communication and computation schedule instead of the usual "creative programming" approach. The obtained parallelization speed-up of the novel algorithm is about twice as much as that for the basic pipelined algorithm and close to that for th e explicit DRP algorithm. Use of the algorithm is demonstrated and comparis ons with other schemes are given. (C) 2000 Academic Press.