ITA
ENG

Efficient algorithms for similarity search

Authors

Rajasekaran, S Hu, Y Luo, J Nick, H Pardalos, PM Sahni, S Shaw, G

Citation

S. Rajasekaran et al., Efficient algorithms for similarity search, J COMB OPTI, 5(1), 2001, pp. 125-132

Citations number

Categorie Soggetti

Mathematics,"Engineering Mathematics

Journal title

JOURNAL OF COMBINATORIAL OPTIMIZATION

ISSN journal

13826905 → ACNP

Volume

Issue

Year of publication

2001

Pages

125 - 132

Database

ISI

SICI code

1382-6905(200103)5:1<125:EAFSS>2.0.ZU;2-1

Abstract

The problem of our interest takes as input a database of m sequences from a n alphabet Sigma and an integer k. The goal is to report all the pairs of s equences that have a matching subsequence of length at least k. We employ t wo algorithms to solve this problem. The first algorithm is based on sortin g and the second is based on generalized suffix trees. We provide experimen tal data comparing the performances of these algorithms. The generalized su ffix tree based algorithm performs better than the sorting based algorithm.