Document segmentation using polynomial spline wavelets

Citation
Sl. Deng et al., Document segmentation using polynomial spline wavelets, PATT RECOG, 34(12), 2001, pp. 2533-2545
Citations number
14
Categorie Soggetti
AI Robotics and Automatic Control
Journal title
PATTERN RECOGNITION
ISSN journal
00313203 → ACNP
Volume
34
Issue
12
Year of publication
2001
Pages
2533 - 2545
Database
ISI
SICI code
0031-3203(200112)34:12<2533:DSUPSW>2.0.ZU;2-W
Abstract
Wavelet transforms have been widely used as effective tools in texture segm entation in the past decade. Segmentation of document images, which usually contain three types of texture information: text, picture and background, can be regarded as a special case of texture segmentation. B-spline wavelet s possess some desirable properties such as being well localized in time an d frequency, and being compactly supported, which make them an effective to ol for texture analysis. Based on the observation that text textures provid e fast-changed and relatively regular distributed edges in the wavelet tran sform domain, an efficient document segmentation algorithm is designed via cubic B-spline wavelets. Three-means or two-means classification is applied for classifying pixels with similar characteristics after feature estimati on at the outputs of high frequency bands of spline wavelet transforms. We examine and evaluate the contributions of different factors to the segmenta tion results from the viewpoints of decomposition levels, frequency bands a nd wavelet functions. Further performance analysis reveals the advantages o f the proposed method. (C) 2001 Pattern Recognition Society. Published by E lsevier Science Ltd. Ail rights reserved.