We present a parallel algorithm for the QR factorization of a dense ma
trix without column pivoting on a message passing multiprocessor. The
algorithm combines the numerical efficiency of Householder reflections
with the excellent communication properties of the torus-wrap mapping
. Analytical results indicate that the communication overhead for this
algorithm is less than that for other common approaches. Numerical re
sults on an nCUBE 2 confirm the efficiency of our technique.