D. Jouanrimbaud et al., CHARACTERIZATION OF THE REPRESENTATIVITY OF SELECTED SETS OF SAMPLES IN MULTIVARIATE CALIBRATION AND PATTERN-RECOGNITION, Analytica chimica acta, 350(1-2), 1997, pp. 149-161
Whenever some samples are extracted from a larger population of sample
s, the representativity of the extracted set towards the original popu
lation should be achieved. Two statistical tests are proposed, to comp
are two data sets, and estimate their representativity. The first one
is the comparison of the variance-covariance matrices of the two data
sets: their equality implies that both data sets have the same directi
on in space, and that the spread of the data points around the mean is
similar. Then, the Mahalanobis distance between the centroids of the
two sets is calculated, in order to know whether the centroids have th
e same position. The presented results show that these tests, when app
lied together, can be used as a diagnostic for the determination of re
presentativity.