SHORTEST COMMON SUPERSTRINGS OF RANDOM STRINGS

Authors
Citation
Ks. Alexander, SHORTEST COMMON SUPERSTRINGS OF RANDOM STRINGS, Journal of Applied Probability, 33(4), 1996, pp. 1112-1126
Citations number
8
Categorie Soggetti
Statistic & Probability","Statistic & Probability
ISSN journal
00219002
Volume
33
Issue
4
Year of publication
1996
Pages
1112 - 1126
Database
ISI
SICI code
0021-9002(1996)33:4<1112:SCSORS>2.0.ZU;2-L
Abstract
Given a finite collection of strings of letters from a fixed alphabet, it is of interest, in the contexts of data compression and DNA sequen cing, to find the length of the shortest string which contains each of the given strings as a consecutive substring. In order to analyze the average behavior of the optimal superstring length, substrings of spe cified lengths are considered with the letters selected independently at random. An asymptotic expression is obtained for the savings from c ompression, i.e. the difference between the uncompressed (concatenated ) length and the optimal superstring length.