FAST ALGORITHMS FOR APPROXIMATELY COUNTING MISMATCHES

Authors
Citation
H. Karloff, FAST ALGORITHMS FOR APPROXIMATELY COUNTING MISMATCHES, Information processing letters, 48(2), 1993, pp. 53-60
Citations number
11
Categorie Soggetti
Information Science & Library Science","Computer Applications & Cybernetics
ISSN journal
00200190
Volume
48
Issue
2
Year of publication
1993
Pages
53 - 60
Database
ISI
SICI code
0020-0190(1993)48:2<53:FAFACM>2.0.ZU;2-5
Abstract
Given a text string T is-an-element-of SIGMA(n) and a pattern string P is-an-element-of SIGMA(m), for each i = 1, 2,...,n-m + 1 define f(i) to be the number of mismatches when the pattern is aligned below the t ext and starts in position i of the text. The fastest known algorithm to compute all the f(i)'s when the alphabet is arbitrary has worst-cas e time exceeding n square-root m. For any epsilon > 0, we give simple randomized and deterministic algorithms that compute g(i) such that f( i) less-than-or-equal-to g(i) less-than-or-equal-to f(i)(1 + epsilon) for all i. The algorithms run in time O((n/epsilon2) log(c)m) for a sm all universal constant c and are easy reductions from the case of arbi trary SIGMA to the case of SIGMA = (0, 1}.