We present two new sets of energy functions for protein structure reco
gnition, given the primary sequence of amino acids along the polypepti
de chain, The first set of potentials is based on the positions of alp
ha- and the second on positions of beta- and alpha-carbon atoms of ami
no acid residues, The potentials are derived using a theory of Boltzma
nn-like statistics of protein structure, The energy terms incorporate
both long-range interactions between residues remote along a chain and
short-range interactions between near neighbors, Distance dependence
is approximated by a piecewise constant function defined on intervals
of equal size, The size of the interval is optimized to preserve as mu
ch detail as possible without introducing excessive error due to limit
ed statistics, A database of 214 non-homologous proteins was used both
for the derivation of the potentials, and for the 'threading' test or
iginally suggested by Hendlich et al. (1990) J. Mol. Biol., 216, 167-1
80. Special care is taken to avoid systematic error in this test, For
threading, we used 100 non-homologous protein chains of 60-205 residue
s, The energy of each of the native structures was compared with the e
nergy of 43000 to 19000 alternative structures generated by threading,
Of these 100 native structures, 92 have the lowest energy with alpha-
carbon-based potentials and, even more, 98 of these 100 structures, ha
ve the lowest energy with the beta- and alpha-carbon based potentials.