Cyclic shifts are intrinsic operations in many parallel algorithms. Th
erefore, it is important to execute them efficiently. In this note, we
present and analyze an algorithm for the cyclic shift operation on n-
dimensional (distributed memory) hypercubes. On asynchronous hypercube
s, we have shown that all previously known algorithms for cyclic shift
s need local message buffers. In order to overcome this, we present an
algorithm that always uses link-disjoint paths for routing. We prove
that by using this algorithm, any cyclic shift can be realized by usin
g at most 4/3 n steps, without using any local message buffers. (C) 19
96 Academic Press, Inc.