ELIMINATING BARRIER SYNCHRONIZATION FOR COMPILER-PARALLELIZED CODES ON SOFTWARE DSMS

Citation
Hs. Han et al., ELIMINATING BARRIER SYNCHRONIZATION FOR COMPILER-PARALLELIZED CODES ON SOFTWARE DSMS, International journal of parallel programming, 26(5), 1998, pp. 591-612
Citations number
27
Categorie Soggetti
Computer Science Theory & Methods","Computer Science Theory & Methods
ISSN journal
08857458
Volume
26
Issue
5
Year of publication
1998
Pages
591 - 612
Database
ISI
SICI code
0885-7458(1998)26:5<591:EBSFCC>2.0.ZU;2-O
Abstract
Software distributed-shared-memory (DSM) systems provide an appealing target for parallelizing compilers due to their flexibility. Previous studies demonstrate such systems can provide performance comparable to message-passing compilers for dense-matrix kernels. However, synchron ization and load imbalance are significant sources of overhead. In thi s paper, we investigate the impact of compilation techniques for elimi nating barrier synchronization overhead in software DSMs. Our compile- time barrier elimination algorithm extends previous techniques in thre e ways: (1) we perform inexpensive communication analysis through loca l subscript analysis when using chunk iteration partitioning for paral lel loops; (2) we exploit delayed updates in lazy-release-consistency DSMs to eliminate barriers guarding only anti-dependences; (3) when po ssible we replace barriers with customized nearest-neighbor synchroniz ation. Experiments on an IBM SP-2 indicate these techniques can improv e parallel performance by 20% on average and by up to 60% for some app lications.