A NEW METHOD OF DETECTING CLUSTERING IN THE DATA

Citation
K. Szczubialka et al., A NEW METHOD OF DETECTING CLUSTERING IN THE DATA, Chemometrics and intelligent laboratory systems, 41(2), 1998, pp. 145-160
Citations number
13
Categorie Soggetti
Computer Science Artificial Intelligence","Robotics & Automatic Control","Instument & Instrumentation","Chemistry Analytical","Computer Science Artificial Intelligence","Robotics & Automatic Control
ISSN journal
01697439
Volume
41
Issue
2
Year of publication
1998
Pages
145 - 160
Database
ISI
SICI code
0169-7439(1998)41:2<145:ANMODC>2.0.ZU;2-3
Abstract
The present study describes a new method of detecting clustering in a data set. For each object in the data, the distances to all other obje cts are calculated, sorted in ascending order, normalized and plotted as so called distance curves. n curves are obtained for data containin g n objects. The shape of these curves, together with their distributi on, give information on clustering of the data and possible distributi on. It is also possible to evaluate the populations of the clusters. T he method, however, fails for very close clusters and for elongated cl usters whose separation distance is much smaller than the range of the ir greatest variability. The method is explained using simulated and r eal data. (C) 1998 Elsevier Science B.V. All rights reserved.