REAL AND COMPLEX FAST FOURIER-TRANSFORMS ON THE FUJITSU VPP-500

Authors
Citation
M. Hegland, REAL AND COMPLEX FAST FOURIER-TRANSFORMS ON THE FUJITSU VPP-500, Parallel computing, 22(4), 1996, pp. 539-553
Citations number
17
Categorie Soggetti
Computer Sciences","Computer Science Theory & Methods
Journal title
ISSN journal
01678191
Volume
22
Issue
4
Year of publication
1996
Pages
539 - 553
Database
ISI
SICI code
0167-8191(1996)22:4<539:RACFFO>2.0.ZU;2-O
Abstract
Fast Fourier transforms parallelize well but need large amounts of com munication. An algorithm which concentrates all the communication in o ne or two transposition steps is the transpose split algorithm, Differ ent transposition algorithms can be used depending on data size and co mmunication latency. A new transpose split algorithm for real and herm itian data is presented for one, two and three dimensional transforms. This algorithm is implemented on the Fujitsu VPP 500. The Fujitsu VPP 500 is a parallel processor with a moderate number of very fast vecto r processors connected by a crossbar switch. Each processor has a peak performance of 1.6 Gflop/s and can simultaneously read and write 400 MByte/s. Very long vector length stride one implementations of multipl e FFTs on one node, as described by the author in 1994, are combined w ith optimized transpositions. One third of peak performance was achiev ed on a configuration with up to 32 processors.