A NOVEL MULTISEED NONHIERARCHICAL DATA CLUSTERING TECHNIQUE

Citation
D. Chaudhuri et Bb. Chaudhuri, A NOVEL MULTISEED NONHIERARCHICAL DATA CLUSTERING TECHNIQUE, IEEE transactions on systems, man and cybernetics. Part B. Cybernetics, 27(5), 1997, pp. 871-877
Citations number
23
Categorie Soggetti
Controlo Theory & Cybernetics","Computer Science Cybernetics","Robotics & Automatic Control
ISSN journal
10834419
Volume
27
Issue
5
Year of publication
1997
Pages
871 - 877
Database
ISI
SICI code
1083-4419(1997)27:5<871:ANMNDC>2.0.ZU;2-R
Abstract
Clustering techniques such as K-means and Forgy as well as their impro ved version ISODATA group data around one seed point for each cluster. It is well known that these methods do not work well if the shape of the cluster is elongated or nonconvex. We argue that for a elongated o r nonconvex shaped cluster, more than one seed is needed, In this pape r a multiseed clustering algorithm is proposed. A density based repres entative point selection algorithm is used to choose the initial seed points. To assign several seed points to one cluster, a minimal spanni ng tree guided novel technique is proposed. Also, a border point detec tion algorithm is proposed for the detection of shape of the cluster. This border in turn signifies whether the cluster is elongated or not. Experimental results show the efficiency of this clustering technique .