ITA
ENG

WEIGHTING AND SELECTION OF VARIABLES FOR CLUSTER-ANALYSIS

Authors

GNANADESIKAN R KETTENRING JR TSAO SL

Citation

R. Gnanadesikan et al., WEIGHTING AND SELECTION OF VARIABLES FOR CLUSTER-ANALYSIS, Journal of classification, 12(1), 1995, pp. 113-136

Citations number

Categorie Soggetti

Social Sciences, Mathematical Methods","Mathematical, Methods, Social Sciences

Journal title

Journal of classification → ACNP

ISSN journal

01764268

Volume

Issue

Year of publication

1995

Pages

113 - 136

Database

ISI

SICI code

0176-4268(1995)12:1<113:WASOVF>2.0.ZU;2-R

Abstract

One of the thorniest aspects of cluster analysis continues to be the w eighting and selection of variables. This paper reports on the perform ance of nine methods on eight ''leading case'' simulated and real sets of data. The results demonstrate shortcomings of weighting based on t he standard deviation or range as well as other more complex schemes i n the literature. Weighting schemes based upon carefully chosen estima tes of within-cluster and between-cluster variability are generally mo re effective. These estimates do not require knowledge of the cluster structure. Additional research is essential: worry-free approaches do not yet exist.