ITA
ENG

SCALABLE LOAD BALANCING STRATEGIES FOR PARALLEL A-ASTERISK ALGORITHMS

Authors

DUTT S MAHAPATRA NR

Citation

S. Dutt et Nr. Mahapatra, SCALABLE LOAD BALANCING STRATEGIES FOR PARALLEL A-ASTERISK ALGORITHMS, Journal of parallel and distributed computing, 22(3), 1994, pp. 488-505

Citations number

Categorie Soggetti

Computer Sciences","Computer Science Theory & Methods

Journal title

Journal of parallel and distributed computing → ACNP

ISSN journal

07437315

Volume

Issue

Year of publication

1994

Pages

488 - 505

Database

ISI

SICI code

0743-7315(1994)22:3<488:SLBSFP>2.0.ZU;2-Y

Abstract

In this paper, we develop load balancing strategies for scalable high- performance parallel A algorithms suitable for distributed-memory mac hines. In parallel A search, inefficiencies such as processor starvat ion and search of nonessential spaces (search spaces not explored by t he sequential algorithm) grow with the number of processors P used, th us restricting its scalability. To alleviate this effect, we propose a novel parallel startup phase and an efficient dynamic load balancing strategy called the quality equalizing (QE) strategy. Our new parallel startup scheme executes optimally in THETA(log P) time and, in additi on, achieves good initial load balance. The QE strategy prossess certa in unique quantitative and qualitative load balancing properties that enable it to significantly reduce starvation and nonessential work. Co nsequently, we obtain a highly scalable parallel A algorithm with an almost-linear speedup. The startup and load balancing schemes were emp loyed in parallel A algorithms to solve the Traveling Salesman Proble m on an nCUBE2 hypercube multicomputer. The QE strategy yields average speedup improvements of about 20-185% and 15-120% at low and intermed iate work densities (the ratio of the problem size to P), respectively , over three well-known load balancing methods-the round-robin (RR), t he random communication (RC), and the neighborhood averaging (NA) stra tegies. The average speedup observed on 1024 processors is about 985, representing a very high efficiency of 0.96. Finally, we analyze and e mpirically evaluate the scalability of parallel A algorithms in terms of the isoefficiency metric. Our analysis gives (1) a THETA(P log P) lower bound on the isoefficiency function of any parallel A algorithm , and (2) a general expression for the upper bound on the isoefficienc y function of our parallel A algorithm using the QE strategy on any t opology-for the hypercube and 2-D mesh architectures the upper bounds on the isoefficiency function are found to be THETA(P log2 P) and THET A(P square-root P), respectively. Experimental results validate our an alysis, and also show that parallel A search has better scalability u sing the QE load balancing strategy than using the RR, RC, or NA strat egies. (C) 1994 Academic Press, Inc.