Adaptive document image binarization

Citation
J. Sauvola et M. Pietikainen, Adaptive document image binarization, PATT RECOG, 33(2), 2000, pp. 225-236
Citations number
24
Categorie Soggetti
AI Robotics and Automatic Control
Journal title
PATTERN RECOGNITION
ISSN journal
00313203 → ACNP
Volume
33
Issue
2
Year of publication
2000
Pages
225 - 236
Database
ISI
SICI code
0031-3203(200002)33:2<225:ADIB>2.0.ZU;2-L
Abstract
A new method is presented for adaptive document image binarization, where t he page is considered as a collection of subcomponents such as text, backgr ound and picture. The problems caused by noise, illumination and many sourc e type-related degradations are addressed. Two new algorithms are applied t o determine a local threshold for each pixel. The performance evaluation of the algorithm utilizes test images with ground-truth, evaluation metrics f or binarization of textual and synthetic images, and a weight-based ranking procedure for the final result presentation. The proposed algorithms were tested with images including different types of document components and deg radations. The results were compared with a number of known techniques in t he literature. The benchmarking results show that the method adapts and per forms well in each case qualitatively and quantitatively. (C) 1999 Pattern Recognition Society. Published by Elsevier Science Ltd. All rights reserved .