Simultaneous variable selection and class fusion for high-dimensional linear discriminant analysis

Authors
Citation
Guo, Jian, Simultaneous variable selection and class fusion for high-dimensional linear discriminant analysis, Biostatistics (Oxford. Print) , 11(4), 2010, pp. 599-608
ISSN journal
14654644
Volume
11
Issue
4
Year of publication
2010
Pages
599 - 608
Database
ACNP
SICI code
Abstract
In many high-dimensional microarray classification problems, an important task is to identify subsets of genes that best discriminate the classes.Nevertheless, existing gene selection methods for microarray classification cannot identify which classes are discriminable by these selected genes.In this paper, we propose an improved linear discriminant analysis (LDA) method that simultaneously selects important genes and identifies the discriminable classes.Specifically, a pairwise fusion penalty for LDA was used to shrink the differences of the class centroids in pairs for each variable and fuse the centroids of indiscriminable classes altogether.The numerical results in analyzing 2 gene expression profiles demonstrate the proposed approach help improve the interpretation of important genes in microarray classification problems.