A high-bandwidth memory pipeline for wide issue processors

Citation
S. Cho et al., A high-bandwidth memory pipeline for wide issue processors, IEEE COMPUT, 50(7), 2001, pp. 709-723
Citations number
37
Categorie Soggetti
Computer Science & Engineering
Journal title
IEEE TRANSACTIONS ON COMPUTERS
ISSN journal
00189340 → ACNP
Volume
50
Issue
7
Year of publication
2001
Pages
709 - 723
Database
ISI
SICI code
0018-9340(200107)50:7<709:AHMPFW>2.0.ZU;2-Z
Abstract
Providing adequate data bandwidth is extremely important for a future wide- issue processor to achieve its full performance potential. Adding a large n umber of ports to a data cache, however, becomes increasingly inefficient a nd can 'add to the hardware complexity significantly. This paper takes an a lternative or complementary approach for providing more data bandwidth, cal led data decoupling. This paper especially studies an interesting, yet less explored, behavior of memory access instructions, called access region loc ality, which is concerned with each static memory instruction and its range of access locations at runtime. Our experimental study using a set of SPEC 95 benchmark programs shows that most memory access instructions reference a single region at runtime. Also shown is that it is possible to accurately predict the access region of a memory instruction at runtime by scrutinizi ng the addressing mode of the instruction and the past access history of it . We describe and evaluate a wide-issue superscalar processor with two dist inct sets of memory pipelines and caches, driven by the access region predi ctor. Experimental results indicate that the proposed mechanism is very eff ective in providing high memory bandwidth to the processor, resulting in co mparable or better performance than a conventional memory design with a hea vily multiported data cache that can lead to much higher hardware complexit y.