WEIGHTING AND SELECTION OF VARIABLES FOR CLUSTER-ANALYSIS

Citation
R. Gnanadesikan et al., WEIGHTING AND SELECTION OF VARIABLES FOR CLUSTER-ANALYSIS, Journal of classification, 12(1), 1995, pp. 113-136
Citations number
32
Categorie Soggetti
Social Sciences, Mathematical Methods","Mathematical, Methods, Social Sciences
Journal title
ISSN journal
01764268
Volume
12
Issue
1
Year of publication
1995
Pages
113 - 136
Database
ISI
SICI code
0176-4268(1995)12:1<113:WASOVF>2.0.ZU;2-R
Abstract
One of the thorniest aspects of cluster analysis continues to be the w eighting and selection of variables. This paper reports on the perform ance of nine methods on eight ''leading case'' simulated and real sets of data. The results demonstrate shortcomings of weighting based on t he standard deviation or range as well as other more complex schemes i n the literature. Weighting schemes based upon carefully chosen estima tes of within-cluster and between-cluster variability are generally mo re effective. These estimates do not require knowledge of the cluster structure. Additional research is essential: worry-free approaches do not yet exist.