On clustering validation techniques

Citation
M. Halkidi et al., On clustering validation techniques, J INTELL IN, 17(2-3), 2001, pp. 107-145
Citations number
32
Categorie Soggetti
Information Tecnology & Communication Systems
Journal title
JOURNAL OF INTELLIGENT INFORMATION SYSTEMS
ISSN journal
09259902 → ACNP
Volume
17
Issue
2-3
Year of publication
2001
Pages
107 - 145
Database
ISI
SICI code
0925-9902(2001)17:2-3<107:OCVT>2.0.ZU;2-2
Abstract
Cluster analysis aims at identifying groups of similar objects and, therefo re helps to discover distribution of patterns and interesting correlations in large data sets. It has been subject of wide research since it arises in many application domains in engineering, business and social sciences. Esp ecially, in the last years the availability of huge transactional and exper imental data sets and the arising requirements for data mining created need s for clustering algorithms that scale and can be applied in diverse domain s. This paper introduces the fundamental concepts of clustering while it surve ys the widely known clustering algorithms in a comparative way. Moreover, i t addresses an important issue of clustering process regarding the quality assessment of the clustering results. This is also related to the inherent features of the data set under concern. A review of clustering validity mea sures and approaches available in the literature is presented. Furthermore, the paper illustrates the issues that are under-addressed by the recent al gorithms and gives the trends in clustering process.