Evaluation of collective I/O implementations on parallel architectures

Citation
Pm. Dickens et R. Thakur, Evaluation of collective I/O implementations on parallel architectures, J PAR DISTR, 61(8), 2001, pp. 1052-1076
Citations number
26
Categorie Soggetti
Computer Science & Engineering
Journal title
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING
ISSN journal
07437315 → ACNP
Volume
61
Issue
8
Year of publication
2001
Pages
1052 - 1076
Database
ISI
SICI code
0743-7315(200108)61:8<1052:EOCIIO>2.0.ZU;2-W
Abstract
In this paper, we evaluate the impact on performance of various implementat ion techniques for collective I/O operations, and we do so across four impo rtant parallel architectures. We show that a naive implementation of collec tive I/O does not result in significant performance gains for any of the ar chitectures, but that an optimized implementation does provide excellent pe rformance across till of the platforms under study. Furthermore, we demonst rate that there exists a single implementation strategy that provides the b est performance for all four computational platforms. Next, we evaluate imp lementation techniques for thread-based collective I/O operations. We show that the most obvious implementation technique, which is to spawn a thread to execute the whole collective I/O operation in the background. frequently provides the worst performance, often performing much worse than just exec uting the collective I/O routine entirely in the foreground. To improve per formance, we explore an alternate approach where part of the collective I/O operation is performed in the background, and part is performed in the for eground. We demonstrate that this implementation technique can provide sign ificant performance gains, offering up to a 50% improvement over implementa tions that do not attempt to overlap collective I/O and computation. (C) 20 01 Academic Press.