COMPRESSION AND EXPLANATION USING HIERARCHICAL GRAMMARS

Citation
Cg. Nevillmanning et Ih. Witten, COMPRESSION AND EXPLANATION USING HIERARCHICAL GRAMMARS, Computer journal, 40(2-3), 1997, pp. 103-116
Citations number
18
Categorie Soggetti
Computer Sciences","Computer Science Hardware & Architecture","Computer Science Information Systems","Computer Science Software Graphycs Programming
Journal title
ISSN journal
00104620
Volume
40
Issue
2-3
Year of publication
1997
Pages
103 - 116
Database
ISI
SICI code
0010-4620(1997)40:2-3<103:CAEUHG>2.0.ZU;2-C
Abstract
This paper describes an algorithm, called SEQUITUR, that identifies hi erarchical structure in sequences of discrete symbols and uses that in formation for compression, On many practical sequences it performs wel l at both compression and structural inference, producing comprehensib le descriptions of sequence structure in the form of grammar rules, Th e algorithm can be stated concisely in the form of two constraints on a context-free grammar, Inference is performed incrementally, the stru cture faithfully representing the input at all times, It can be implem ented efficiently and operates in a time that is approximately linear in sequence length, Despite its simplicity and efficiency, SEQUITUR su cceeds in inferring a range of interesting hierarchical structures fro m naturally occurring sequences.