Rww. Hooft et al., THE PDBFINDER DATABASE - A SUMMARY OF PDB, DSSP AND HSSP INFORMATION WITH ADDED VALUE, Computer applications in the biosciences, 12(6), 1996, pp. 525-529
Motivation: The Protein Data Bank currently contains more than 4700 pr
otein coordinate sets. It is often desirable to make a selection from
these files based on a criterion like R-factor, experimental method, l
ength of the amino acid sequence, or the number of homologous sequence
s in SWISSPROT. Doing this using the distributed form of the Protein D
ata Bank can be a tedious task, because (I) this requires reading one
file for every single entry, and (2) not all of the information is pre
sent in a consistent computer readable way in all of the entries. Resu
lts: The PDBFINDER database provides an easy to interpret file contain
ing summary information about all Protein Data Bank files. Summary inf
ormation from the DSSP (Definition of Secondary Structure of Proteins)
and HSSP (Homology derived Secondary Structure of Proteins) databases
is also included. Furthermore, where essential data were missing from
the Protein Data Bank file, this information has been retrieved from
the original literature.