The impact of exploiting instruction-level parallelism on shared-memory multiprocessors

Citation
Vs. Pai et al., The impact of exploiting instruction-level parallelism on shared-memory multiprocessors, IEEE COMPUT, 48(2), 1999, pp. 218-226
Citations number
14
Categorie Soggetti
Computer Science & Engineering
Journal title
IEEE TRANSACTIONS ON COMPUTERS
ISSN journal
00189340 → ACNP
Volume
48
Issue
2
Year of publication
1999
Pages
218 - 226
Database
ISI
SICI code
0018-9340(199902)48:2<218:TIOEIP>2.0.ZU;2-L
Abstract
Current microprocessors incorporate techniques to aggressively exploit inst ruction-level parallelism (ILP). This paper evaluates the impact of such pr ocessors on the performance of shared-memory multiprocessors, both without and with the latency-hiding optimization of software prefetching. Our resul ts show that, while ILP techniques substantially reduce CPU time in multipr ocessors, they are less effective in removing memory stall time. Consequent ly, despite the inherent latency tolerance features of ILP processors, we f ind memory system performance to be a larger bottleneck and parallel effici encies to be generally poorer in ILP-based multiprocessors than in previous generation multiprocessors. The main reasons for these deficiencies are in sufficient opportunities in the applications to overlap multiple load misse s and increased contention for resources in the system. We also find that s oftware prefetching does not change the memory bound nature of most of our applications on our ILP multiprocessor, mainly due to a large number of lat e prefetches and resource contention. Our results suggest the need for addi tional latency hiding or reducing techniques for ILP systems, such as softw are clustering of load misses and producer-initiated communication.