ITA
ENG

Mass spectrometry allows direct identification of proteins in large genomes

Authors

Kuster, B Mortensen, P Andersen, JS Mann, M

Citation

B. Kuster et al., Mass spectrometry allows direct identification of proteins in large genomes, PROTEOMICS, 1(5), 2001, pp. 641-650

Citations number

Categorie Soggetti

Chemistry & Analysis

Journal title

PROTEOMICS

ISSN journal

16159853 → ACNP

Volume

Issue

Year of publication

2001

Pages

641 - 650

Database

ISI

SICI code

1615-9853(200105)1:5<641:MSADIO>2.0.ZU;2-H

Abstract

Proteome projects seek to provide systematic functional analysis of the gen es uncovered by genome sequencing initiatives. Mass spectrometric protein i dentification is a key requirement in these studies but to date, database s earching tools rely on the availability of protein sequences derived from f ull length cDNA, expressed sequence tags or predicted open reading frames ( ORFs) from genomic sequences. We demonstrate here that proteins can be iden tified directly in large genomic databases using peptide sequence tags obta ined by tandem mass spectrometry. On the background of vast amounts of nonc oding DNA sequence, identified peptides localize coding sequences (exons) i n a confined region of the genome, which contains the cognate gene. The app roach does not require prior information about putative ORFs as predicted b y computerized gene finding algorithms. The method scales to the complete h uman genome and allows identification, mapping, cloning and assistance in g ene prediction of any protein for which minimal mass spectrometric informat ion can be obtained. Several novel proteins from Arabidopsis thaliana and h uman have been discovered in this way.