Geometric structure analysis of document images: A knowledge-based approach

Citation
Kh. Lee et al., Geometric structure analysis of document images: A knowledge-based approach, IEEE PATT A, 22(11), 2000, pp. 1224-1240
Citations number
43
Categorie Soggetti
AI Robotics and Automatic Control
Journal title
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE
ISSN journal
01628828 → ACNP
Volume
22
Issue
11
Year of publication
2000
Pages
1224 - 1240
Database
ISI
SICI code
0162-8828(200011)22:11<1224:GSAODI>2.0.ZU;2-G
Abstract
Geometric structure analysis is a prerequisite to create electronic documen ts from logical components extracted from document images. This paper prese nts a knowledge-based method for sophisticated geometric structure analysis of technical journal pages. The proposed knowledge base encodes geometric characteristics that are not only common in technical journals but also pub lication-specific in the farm of rules. The method takes the hybrid of top- down and bottom-up techniques and consists of two phases: region segmentati on and identification. Generally, the result of the segmentation process do es not have a one-to-one matching with composite layout components. Therefo re, the proposed method identifies nontext objects, such as images drawings , and tables, as well as text objects, such as text lines and equations, by splitting or grouping segmented regions into composite layout components. Experimental results with 372 images scanned from the IEEE Transactions on Pattern Analysis and Machine Intelligence show that the proposed method has performed geometric structure analysis successfully on more than 99 percen t of the test images, resulting in impressive performance compared with pre vious works.