UNBOUNDED LENGTH CONTEXTS FOR PPM

Citation
Jg. Cleary et Wj. Teahan, UNBOUNDED LENGTH CONTEXTS FOR PPM, Computer journal, 40(2-3), 1997, pp. 67-75
Citations number
25
Categorie Soggetti
Computer Sciences","Computer Science Hardware & Architecture","Computer Science Information Systems","Computer Science Software Graphycs Programming
Journal title
ISSN journal
00104620
Volume
40
Issue
2-3
Year of publication
1997
Pages
67 - 75
Database
ISI
SICI code
0010-4620(1997)40:2-3<67:ULCFP>2.0.ZU;2-Z
Abstract
The PPM data compression scheme has set the performance standard in lo ssless compression of text throughout the past decade, PPM is a finite context statistical modelling technique that can be viewed as blendin g together several fixed-order context models to predict the next char acter in the input sequence, This paper gives a brief introduction to PPM, and describes a variant of the algorithm, called PPM, which expl oits contexts of unbounded length. Although requiring considerably gre ater computational resources (in both time and space), this reliably a chieves compression superior to the benchmark PPMC version, Its major contribution is that it shows that the full. information available by considering all substrings of the input string can be used effectively to generate high-quality predictions, Hence, it provides a useful too l for exploring the bounds of compression.