DISTRIBUTED LOAD BALANCING FOR PARALLEL MAIN MEMORY HASH JOIN

Citation
Wr. Tout et S. Pramanik, DISTRIBUTED LOAD BALANCING FOR PARALLEL MAIN MEMORY HASH JOIN, IEEE transactions on parallel and distributed systems, 6(8), 1995, pp. 841-849
Citations number
19
Categorie Soggetti
System Science","Engineering, Eletrical & Electronic","Computer Science Theory & Methods
ISSN journal
10459219
Volume
6
Issue
8
Year of publication
1995
Pages
841 - 849
Database
ISI
SICI code
1045-9219(1995)6:8<841:DLBFPM>2.0.ZU;2-Z
Abstract
Parallel joins have been widely studied during the past decade and a n umber of efficient algorithms were presented. While it is known that t he performance of these algorithms may suffer greatly in the presence of skewed input data, the work on load balancing schemes for parallel join has been limited. The main contribution of this paper is the deve lopment and analysis of a new distributed data structure and an effect ive load balancing scheme for parallel main memory hash join on NUMA a rchitecture. Multiprocessors based on this architecture are scalable i n both size of main memory and number of processors, and provide very high memory bandwidth. The load balancing scheme is based on random pr obing to avoid the hot spot problems caused by probing sequentially. W e have modeled this load balancing scheme both analytically and experi mentally. The experiments were run on a BBN TC2000 multiprocessor syst em.