AMINO-ACID DOUBLETS AND TRIPLETS IN PROTEIN SEQUENCES - A DATABASE ANALYSIS

Citation
K. Veluraja et Sa. Mugilan, AMINO-ACID DOUBLETS AND TRIPLETS IN PROTEIN SEQUENCES - A DATABASE ANALYSIS, Current Science, 72(8), 1997, pp. 572-577
Citations number
11
Categorie Soggetti
Multidisciplinary Sciences
Journal title
ISSN journal
00113891
Volume
72
Issue
8
Year of publication
1997
Pages
572 - 577
Database
ISI
SICI code
0011-3891(1997)72:8<572:ADATIP>2.0.ZU;2-U
Abstract
Sequence analysis on the aspect of amino acid doublet and triplet repe ats in 36,000 proteins comprising a total of 1,24,96,420 amino acid en tries was carried out to study the structural aspects of proteins. Any deviation (normalized) of the doublet and triplet occurrence from the ir anticipated occurrence based on the frequency of the individual ami no acid entries was attributed either to preferential selection (+ve d eviation) or to non preferential selection (-ve deviation). All the ho modoublets and homotriplets, except Ile-Ile-Ile, show positive deviati on, implying the preferential selection for homomers. Most of the hete rodoublets show marginal deviation indicating the expected distributio n. A few doublets do show high positive and high negative deviations, containing the less propensity amino acids in the repeat. Most of the triplets which contain two identical amino acids in the repeat show po sitive deviations. In this category too, the high positively deviated triplets mainly contain less propensity amino acids, substantiating th e economical usage of less propensity amino acids. A few triplets whic h contain two proline residues have large difference in observed and e xpected frequencies, in addition to high positive deviation indicating important structural role of proline residues. Most of the heterotrip lets which have three different amino acid entries in the repeat show marginal deviations. The triplets with large deviation in this categor y also are dominated by less propensity amino acids. High frequency do ublet pairs containing high propensity amino acid in the repeat (Glu-L ys, Lys-Glu), contribute structurally to alpha-helical conformation.