High bandwidth on-chip cache design

Citation
Km. Wilson et K. Olukotun, High bandwidth on-chip cache design, IEEE COMPUT, 50(4), 2001, pp. 292-307
Citations number
28
Categorie Soggetti
Computer Science & Engineering
Journal title
IEEE TRANSACTIONS ON COMPUTERS
ISSN journal
00189340 → ACNP
Volume
50
Issue
4
Year of publication
2001
Pages
292 - 307
Database
ISI
SICI code
0018-9340(200104)50:4<292:HBOCD>2.0.ZU;2-G
Abstract
In this paper, we evaluate the performance of high bandwidth cache organiza tions employing multiple cache ports, multiple cycle hit times, and cache p ort efficiency enhancements, such as load all and line buffer, to find the organization that provides the best processor performance. Using a dynamic superscalar processor running realistic benchmarks that include operating s ystem references, we use execution time to measure processor performance. W hen the cache is limited to a single cache port without enhancements, we fi nd that two cache ports increase processor performance by 25 percent. With the addition of line buffer and load all to a single ported cache, the proc essor achieves 91 percent of the performance of the same processor containi ng a cache with two ports. When the processor is not limited to a single ca che port, the results show that a large dual-ported multicycle pipelined SR AM cache with a line buffer maximizes processor performance. A targe pipeli ned cache provides both a low miss rate and a high CPU clock frequency. Dua l-porting the cache and using a line buffer provide the bandwidth needed by a dynamic superscalar processor. The line buffer makes the pipelined dual- ported cache the best option by increasing cache port bandwidth and hiding cache latency.