ITA
ENG

A new approach to sequence comparison: normalired sequence alignment

Authors

Arslan, AN Egecioglu, O Pevzner, PA

Citation

An. Arslan et al., A new approach to sequence comparison: normalired sequence alignment, BIOINFORMAT, 17(4), 2001, pp. 327-337

Citations number

Categorie Soggetti

Multidisciplinary

Journal title

BIOINFORMATICS

ISSN journal

13674803 → ACNP

Volume

Issue

Year of publication

2001

Pages

327 - 337

Database

ISI

SICI code

1367-4803(200104)17:4<327:ANATSC>2.0.ZU;2-E

Abstract

The Smith-Waterman algorithm for local sequence alignment is one of the mos t important techniques in computational molecular biology. This ingenious d ynamic programming approach was designed to reveal the highly conserved fra gments by discarding poorly conserved initial and terminal segments. Howeve r, the existing notion of local similarity has a serious flaw: it does not discard poorly conserved intermediate segments. The Smith-Waterman algorith m finds the local alignment with maximal score but it is unable to find loc al alignment with maximum degree of similarity (e,g, maximal percent of mat ches). Moreover, there is still no efficient algorithm that answers the fol lowing natural question: do two sequences share a (sufficiently long) fragm ent with more than 70% of similarity? As a result, the local alignment some times produces a mosaic of well-conserved fragments artificially connected by poorly-conserved or even unrelated fragments. This may lead to problems in comparison of long genomic sequences and comparative gene prediction as recently pointed out by Zhang et al, (Bioinformatics, 15, 1012-1019, 1999). In this paper we propose a new sequence comparison algorithm (normalized l ocal alignment) that reports the regions with maximum degree of similarity. The algorithm is based on fractional programming and its running time is O (n(2) log n). In practice, normalized local alignment is only 3-5 times slo wer than the standard Smith-Waterman algorithm.