Efficient algorithms for similarity search

Citation
S. Rajasekaran et al., Efficient algorithms for similarity search, J COMB OPTI, 5(1), 2001, pp. 125-132
Citations number
5
Categorie Soggetti
Mathematics,"Engineering Mathematics
Journal title
JOURNAL OF COMBINATORIAL OPTIMIZATION
ISSN journal
13826905 → ACNP
Volume
5
Issue
1
Year of publication
2001
Pages
125 - 132
Database
ISI
SICI code
1382-6905(200103)5:1<125:EAFSS>2.0.ZU;2-1
Abstract
The problem of our interest takes as input a database of m sequences from a n alphabet Sigma and an integer k. The goal is to report all the pairs of s equences that have a matching subsequence of length at least k. We employ t wo algorithms to solve this problem. The first algorithm is based on sortin g and the second is based on generalized suffix trees. We provide experimen tal data comparing the performances of these algorithms. The generalized su ffix tree based algorithm performs better than the sorting based algorithm.