ITA
ENG

UNBOUNDED LENGTH CONTEXTS FOR PPM

Authors

CLEARY JG TEAHAN WJ

Citation

Jg. Cleary et Wj. Teahan, UNBOUNDED LENGTH CONTEXTS FOR PPM, Computer journal, 40(2-3), 1997, pp. 67-75

Citations number

Categorie Soggetti

Computer Sciences","Computer Science Hardware & Architecture","Computer Science Information Systems","Computer Science Software Graphycs Programming

Journal title

Computer journal → ACNP

ISSN journal

00104620

Volume

Issue

2-3

Year of publication

1997

Pages

67 - 75

Database

ISI

SICI code

0010-4620(1997)40:2-3<67:ULCFP>2.0.ZU;2-Z

Abstract

The PPM data compression scheme has set the performance standard in lo ssless compression of text throughout the past decade, PPM is a finite context statistical modelling technique that can be viewed as blendin g together several fixed-order context models to predict the next char acter in the input sequence, This paper gives a brief introduction to PPM, and describes a variant of the algorithm, called PPM, which expl oits contexts of unbounded length. Although requiring considerably gre ater computational resources (in both time and space), this reliably a chieves compression superior to the benchmark PPMC version, Its major contribution is that it shows that the full. information available by considering all substrings of the input string can be used effectively to generate high-quality predictions, Hence, it provides a useful too l for exploring the bounds of compression.