Parameter-free geometric document layout analysis

Authors
Citation
Sw. Lee et Ds. Ryu, Parameter-free geometric document layout analysis, IEEE PATT A, 23(11), 2001, pp. 1240-1256
Citations number
18
Categorie Soggetti
AI Robotics and Automatic Control
Journal title
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE
ISSN journal
01628828 → ACNP
Volume
23
Issue
11
Year of publication
2001
Pages
1240 - 1256
Database
ISI
SICI code
0162-8828(200111)23:11<1240:PGDLA>2.0.ZU;2-P
Abstract
Automatic transformation of paper documents into electronic documents requi res geometric document layout analysis at the first stage. However, variati ons in character font sizes, text line spacing, and document layout structu res have made it difficult to design a general-purpose document layout anal ysis algorithm for many years. The use of some parameters has therefore bee n unavoidable in previous methods. In this paper, we propose a parameter-fr ee method for segmenting the document images into maximal homogeneous regio ns and identifying them as texts, images, tables, and ruling lines. A pyram idal quadtree structure is constructed for multiscale analysis and a period icity measure is suggested to find a periodical attribute of text regions f or page segmentation. To obtain robust page segmentation results, a confirm ation procedure using texture analysis is applied to only ambiguous regions . Based on the proposed periodicity measure, multiscale analysis, and confi rmation procedure, we could develop a robust method for geometric document layout analysis independent of character font sizes, text line spacing, and document layout structures. The proposed method was experimented with the document database from the University of Washington and the MediaTeam Docum ent Database. The results of these tests have shown that the proposed metho d provides more accurate results than the previous ones.