DATA-BANK HOMOLOGY SEARCH ALGORITHM WITH LINEAR COMPUTATION COMPLEXITY

Citation
Vb. Strelets et al., DATA-BANK HOMOLOGY SEARCH ALGORITHM WITH LINEAR COMPUTATION COMPLEXITY, Computer applications in the biosciences, 10(3), 1994, pp. 319-322
Citations number
15
Categorie Soggetti
Mathematical Methods, Biology & Medicine","Computer Sciences, Special Topics","Computer Science Interdisciplinary Applications","Biology Miscellaneous
ISSN journal
02667061
Volume
10
Issue
3
Year of publication
1994
Pages
319 - 322
Database
ISI
SICI code
0266-7061(1994)10:3<319:DHSAWL>2.0.ZU;2-4
Abstract
A new algorithm for data bank homology search is proposed. The princip al advantages of the new algorithm are: (i) linear computation complex ity, (ii) law memory requirements; and (iii) high sensitivity to the p resence of local legion homology. The algorithm first calculates indic ative matrices of k-tuple 'realization' in the query sequence and then searches for an appropriate number of matching k-tuples within a narr ow range in database sequences. It does not require k-tuple coordinate s fabulation and in-memory placement for database sequences. The algor ithm is implemented in a program for execution on PC-compatible comput ers and tested on PIR and GenBank databases with good results. A few m odifications designed to improve the selectivity are also discussed. A s an application example, the search for homology of the mouse homeoti c protein HOX 3.1 is given.