Dimensionality reduction in unsupervised learning of conditional Gaussian networks

Citation
Jm. Pena et al., Dimensionality reduction in unsupervised learning of conditional Gaussian networks, IEEE PATT A, 23(6), 2001, pp. 590-603
Citations number
42
Categorie Soggetti
AI Robotics and Automatic Control
Journal title
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE
ISSN journal
01628828 → ACNP
Volume
23
Issue
6
Year of publication
2001
Pages
590 - 603
Database
ISI
SICI code
0162-8828(200106)23:6<590:DRIULO>2.0.ZU;2-9
Abstract
This paper introduces a novel enhancement for unsupervised learning of cond itional Gaussian networks that benefits from feature selection. Our proposa l is based on the assumption that, in the absence of labels reflecting the cluster membership of each case of the database, those features that exhibi t low correlation with the rest of the features can be considered irrelevan t for the learning process. Thus, we suggest performing this process using only the relevant features. Then, every irrelevant feature is added to the learned model to obtain an explanatory model for the original database whic h is our primary goal. A simple and, thus, efficient measure to assess the relevance of the features for the learning process is presented. Additional ly, the form of this measure allows us to calculate a relevance threshold t o automatically identify the relevant features. The experimental results re ported for synthetic and real-world databases show the ability of our propo sal to distinguish between relevant and irrelevant features and to accelera te learning; however, still obtaining good explanatory models for the origi nal database.