ITA
ENG

Relationships between the definition of the hyperplane width to the fidelity of principal component loading patterns

Authors

Richman, MB Gong, XF

Citation

Mb. Richman et Xf. Gong, Relationships between the definition of the hyperplane width to the fidelity of principal component loading patterns, J CLIMATE, 12(6), 1999, pp. 1557-1576

Citations number

Categorie Soggetti

Earth Sciences

Journal title

JOURNAL OF CLIMATE

ISSN journal

08948755 → ACNP

Volume

Issue

Year of publication

1999

Pages

1557 - 1576

Database

ISI

SICI code

0894-8755(199906)12:6<1557:RBTDOT>2.0.ZU;2-Y

Abstract

When applying eigenanalysis, one decision analysts make is the determinatio n of what magnitude an eigen vector coefficient (e.g., principal component (PC) loading) must achieve to be considered as physically important. Such c oefficients can be displayed on maps or in a time series or tables to gain a fuller understanding of a large array of multivariate data. Previously, s uch a decision on what value of loading designates a useful signal (hereaft er called the loading "cutoff") for each eigenvector has been purely subjec tive. The importance of selecting such a cutoff is apparent since those loa ding elements in the range of zero to the cutoff are ignored in the interpr etation and naming of PCs since only the absolute values of loadings greate r than the cutoff are physically analyzed. This research sets out to object ify the problem of best identifying the cutoff by application of matching b etween known correlation/covariance structures and their corresponding eige npatterns, as this cutoff point (known as the hyperplane width) is varied. A Monte Carlo framework is used to resample at five sample sizes. Fourteen different hyperplane cutoff widths are tested, bootstrap resampled 50 times to obtain stable results. The key findings are that the location of an opt imal hyperplane cutoff width (one which maximized the information content m atch between the eigenvector and the parent dispersion matrix from which it was derived) is a well-behaved unimodal function. On an individual eigenve ctor. this enables the unique determination of a hyperplane cutoff value to be used to separate those loadings that best reflect the relationships fro m those that do not. The effects of sample size on the matching accuracy ar e dramatic as the values for all solutions (i.e., unrotated, rotated) rose steadily from 25 through 250 observations and then weakly thereafter. The s pecific matching coefficients are useful to assess the penalties incurred w hen one analyzes eigenvector coefficients of a lower absolute value than th e cutoff (termed coefficient in the hyperplane) or, alternatively, chooses not to analyze coefficients that contain useful physical signal outside of the hyperplane. Therefore, this study enables the analyst to make the best use of the information available in their PCs to shed light on complicated data structures.