A HEURISTIC STORAGE FOR MINIMIZING ACCESS TIME OF ARBITRARY DATA PATTERNS

Citation
Ma. Almouhamed et Ss. Seiden, A HEURISTIC STORAGE FOR MINIMIZING ACCESS TIME OF ARBITRARY DATA PATTERNS, IEEE transactions on parallel and distributed systems, 8(4), 1997, pp. 441-447
Citations number
12
Categorie Soggetti
System Science","Engineering, Eletrical & Electronic","Computer Science Theory & Methods
ISSN journal
10459219
Volume
8
Issue
4
Year of publication
1997
Pages
441 - 447
Database
ISI
SICI code
1045-9219(1997)8:4<441:AHSFMA>2.0.ZU;2-2
Abstract
The serialization of memory accesses is a major limiting factor in hig h performance SIMD computers. The data patterns or templates that are accessed by a program can be perceived by the compiler, and, therefore , the design of dynamic storage schemes that minimize conflicts may dr amatically improve performance. The problem of finding storage schemes that minimize the access time of arbitrary sets of power-of-two data patterns is proved to be NP-complete. We propose linear address transf ormations that can be dynamically applied by each processing element f or mapping array references onto memories. An efficient approach for c ombining the constraints of different access patterns into one single linear address transformation is presented. We prove that finding the transformation that minimizes the access time is reducible to N-colori ng, where N is the number of parallel memories. Using coloring heurist ics, storage schemes are investigated with respect to minimizing the i mplementation cost (perfect storage) and overall access conflicts (sem iperfect storage). Results show that the perfect-storage may deviate o n the average by 20% from the optimum access time in the case of 10 ar bitrary data patterns and 16 memories. However, semiperfect schemes le ad to dramatic reduction of the degree of conflict compared to perfect -schemes. The proposed heuristic storage largely outperforms interleav ing and row-column-diagonals storages. The method can be implemented a s compiler procedure for synthesizing storage schemes that promote par allel access to arbitrary sets of data patterns.