DETERMINATION OF PROTEIN TERTIARY STRUCTURE CLASS FROM CIRCULAR-DICHROISM SPECTRA

Citation
Sy. Venyaminov et Ks. Vassilenko, DETERMINATION OF PROTEIN TERTIARY STRUCTURE CLASS FROM CIRCULAR-DICHROISM SPECTRA, Analytical biochemistry, 222(1), 1994, pp. 176-184
Citations number
22
Categorie Soggetti
Biology
Journal title
ISSN journal
00032697
Volume
222
Issue
1
Year of publication
1994
Pages
176 - 184
Database
ISI
SICI code
0003-2697(1994)222:1<176:DOPTSC>2.0.ZU;2-4
Abstract
Fifty-three circular dichroism (CD) spectra consisting of the spectra of 46 native proteins, 3 denatured proteins, and one oligopeptide (the spectra of two denatured proteins and oligopeptide were taken at two different temperatures) were investigated in order to examine the corr elation between the shape of the CD spectrum and the tertiary structur e class of the protein. Five classes were considered-all - alpha, all - beta, alpha + beta, alpha/beta, and denatured proteins. Spectra from 190 to 236 nm with 2 nm interval were described as points in 24-dimen sional hyperspace, where coordinates were values of ellipticities at f ixed wavelengths. This allows the spectra to be treated as patterns an d subsequently analyzed using pattern recognition algorithms. Cluster analysis, which does not need predefined information about protein str ucture, divides spectra into several compact groups or clusters with g ood correlation with tertiary structure class. To visualize these resu lts, orthogonalization procedures were imposed on the original data se t in 24-dimensional space. The new 3-dimensional coordinate system dem onstrated well-separated all-beta class and denatured proteins. Region s corresponding to all - alpha and especially alpha + beta and alpha/b eta proteins were not as well resolved. The following approach was the n applied to the original data set to obtain an objective mathematical algorithm for the determination of a protein's tertiary structure cla ss from its CD spectrum. Regions in 24-dimensional hyperspace correspo nding to all of the tertiary structure classes were found by calculati ng the decision functions, or equations of hyperplanes, which separate groups of spectral patterns of different classes. The class represent ing the region which involves the pattern of a protein spectrum can be interpreted as a tertiary structure class of this protein. The accura cy of the method was checked by removing one of the proteins from the training set, finding all the decision functions, and determinating th e class of the excluded protein. This test gives 100% accuracy for all - alpha, alpha/beta, and denatured proteins; 85% for alpha + beta and 75% for all - beta proteins. (C) 1994 Academic Press, Inc.