A COMPARISON OF APPROXIMATE STRING-MATCHING ALGORITHMS

Citation
P. Jokinen et al., A COMPARISON OF APPROXIMATE STRING-MATCHING ALGORITHMS, Software, practice & experience, 26(12), 1996, pp. 1439-1458
Citations number
26
Categorie Soggetti
Computer Sciences","Computer Science Software Graphycs Programming
ISSN journal
00380644
Volume
26
Issue
12
Year of publication
1996
Pages
1439 - 1458
Database
ISI
SICI code
0038-0644(1996)26:12<1439:ACOASA>2.0.ZU;2-G
Abstract
Experimental comparisons of the running time of approximate string mat ching algorithms for the k differences problem are presented. Given a pattern string, a text string, and an integer k, the task is to find a ll approximate occurrences of the pattern in the text with at most k d ifferences (insertions, deletions, changes). We consider seven algorit hms based on different approaches including dynamic programming, Boyer -Moore string matching, suffix automata, and the distribution of chara cters. It turns out that none of the algorithms is the best for all va lues of the problem parameters, and the speed differences between the methods can be considerable.