A MULTISEED NON-HIERARCHICAL CLUSTERING TECHNIQUE FOR DATA-ANALYSIS

Citation
D. Chaudhuri et Bb. Chaudhuri, A MULTISEED NON-HIERARCHICAL CLUSTERING TECHNIQUE FOR DATA-ANALYSIS, International Journal of Systems Science, 26(2), 1995, pp. 375-385
Citations number
12
Categorie Soggetti
System Science","Computer Science Theory & Methods","Operatione Research & Management Science
ISSN journal
00207721
Volume
26
Issue
2
Year of publication
1995
Pages
375 - 385
Database
ISI
SICI code
0020-7721(1995)26:2<375:AMNCTF>2.0.ZU;2-K
Abstract
Clustering methods such as K-means and its variations, such as Forgy, as well as their improved version ISODATA, do not work well if the sha pe of the cluster is elongated. It is pointed out that a single seed p oint cannot correctly reflect the nature of the data of an elongated c luster. A multiseed clustering algorithm is proposed, where one cluste r may contain more than one seed point. A density-based algorithm is u sed to choose the initial seed points. To assign several seed points t o one cluster, a minimal spanning tree guided novel merging technique is proposed. The merging technique is quite general and may be applied to other clustering approaches as well. Experimental results are pres ented to demonstrate the efficiency of this clustering procedure.