The identification of influential subsets in regression using a fuzzy clustering strategy

Citation
B. Seaver et al., The identification of influential subsets in regression using a fuzzy clustering strategy, TECHNOMET, 41(4), 1999, pp. 340-351
Citations number
42
Categorie Soggetti
Mathematics
Journal title
TECHNOMETRICS
ISSN journal
00401706 → ACNP
Volume
41
Issue
4
Year of publication
1999
Pages
340 - 351
Database
ISI
SICI code
0040-1706(199911)41:4<340:TIOISI>2.0.ZU;2-Z
Abstract
A fuzzy clustering strategy is used to identify subsets of influential obse rvations in regression, As part of the fuzzy clustering strategy, the analy st can explore the uniqueness of selected subsets and the degree of belongi ng of observations to selected subsets. This is accomplished by either vary ing a fuzzy parameter or the number of clusters. Once the observations or s ubsets have been identified, the analyst can then compute regression diagno stics to confirm their degree of influence in regression. Bootstrapping and high-breakdown procedures were used to confirm the influence of the previo usly identified subsets. This fuzzy clustering strategy is applied to the m odified data on wood-specific gravity and an augmented production dataset. Both datasets have been previously presented in the literature.