MULTILEVEL POST-PROCESSING FOR KOREAN CHARACTER-RECOGNITION USING MORPHOLOGICAL ANALYSIS AND LINGUISTIC EVALUATION

Authors
Citation
G. Lee et al., MULTILEVEL POST-PROCESSING FOR KOREAN CHARACTER-RECOGNITION USING MORPHOLOGICAL ANALYSIS AND LINGUISTIC EVALUATION, Pattern recognition, 30(8), 1997, pp. 1347-1360
Citations number
20
Categorie Soggetti
Computer Sciences, Special Topics","Engineering, Eletrical & Electronic","Computer Science Artificial Intelligence
Journal title
ISSN journal
00313203
Volume
30
Issue
8
Year of publication
1997
Pages
1347 - 1360
Database
ISI
SICI code
0031-3203(1997)30:8<1347:MPFKCU>2.0.ZU;2-X
Abstract
Most of the post-processing methods for character recognition rely on contextual information of character and word-fragment levels. However, due to linguistic characteristics of Korean, such low-level informati on alone is not sufficient for high-quality character-recognition appl ications, and we need much higher-level contextual information to impr ove the recognition results. This paper presents a domain independent postprocessing technique that utilizes multi-level morphological, synt actic, and semantic information as well as character-level information . The proposed post-processing system performs three-level processing: candidate character-set selection, candidate eojeol (Korean word) gen eration through morphological analysis, and final single eojeol-sequen ce selection by linguistic evaluation. All the required linguistic inf ormation and probabilities are automatically acquired from a statistic al corpus analysis. Experimental results demonstrate the effectiveness of our method, yielding an error correction rate of 80.46%, and impro ved recognition rate of 95.53% from the before-post-processing rate of 71.2% for single best-solution selection. (C) 1997 Pattern Recognitio n Society. Published by Elsevier Science Ltd.