The Wilcoxon-Mann-Whitney test enjoys great popularity among scientists com
paring two groups of observations, especially when measurements made on a c
ontinuous scale are non-normally distributed. Triggered by different result
s for the procedure from two statistics programs, we compared the outcomes
from 11 PC-based statistics packages. The findings were that the delivered
p values ranged from significant to nonsignificant at the 5% level, dependi
ng on whether a large-sample approximation or an exact permutation form of
the test was used and, in the former case, whether or not a correction for
continuity was used and whether or not a correction for ties was made. Some
packages also produced pseudo-exact p values, based on the null distributi
on under the assumption of no ties. A further crucial point is that the var
iant of the algorithm used for computation by the packages is rarely indica
ted in the output or documented in the Help facility and the manuals. We co
nclude that the only accurate form of the Wilcoxon-Mann-Whitney procedure i
s one in which the exact permutation null distribution is compiled for the
actual data.