Knowledge-based analysis of microarray gene expression data by using support vector machines

Citation
Mps. Brown et al., Knowledge-based analysis of microarray gene expression data by using support vector machines, P NAS US, 97(1), 2000, pp. 262-267
Citations number
32
Categorie Soggetti
Multidisciplinary
Journal title
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA
ISSN journal
00278424 → ACNP
Volume
97
Issue
1
Year of publication
2000
Pages
262 - 267
Database
ISI
SICI code
0027-8424(20000104)97:1<262:KAOMGE>2.0.ZU;2-S
Abstract
We introduce a method of functionally classifying genes by using gene expre ssion data from DNA microarray hybridization experiments. The method is bas ed on the theory of support vector machines (SVMs). SVMs are considered a s upervised computer learning method because they exploit prior knowledge of gene function to identify unknown genes of similar function from expression data. SVMs avoid several problems associated with unsupervised clustering methods, such as hierarchical clustering and self-organizing maps. SVMs hav e many mathematical features that make them attractive for gene expression analysis, including their flexibility in choosing a similarity function, sp arseness of solution when dealing with large data sets, the ability to hand le large feature spaces, and the ability to identify outliers. We test seve ral SVMs that use different similarity metrics, as well as some other super vised learning methods, and find that the SVMs best identify sets of genes with a common function using expression data. Finally, we use SVMs to predi ct functional roles for uncharacterized yeast ORFs based on their expressio n data.