COMPUTER RECOGNITION OF PRINTED BANGLA SCRIPT

Citation
U. Pal et Bb. Chaudhuri, COMPUTER RECOGNITION OF PRINTED BANGLA SCRIPT, International Journal of Systems Science, 26(11), 1995, pp. 2107-2123
Citations number
23
Categorie Soggetti
System Science","Computer Science Theory & Methods","Operatione Research & Management Science
ISSN journal
00207721
Volume
26
Issue
11
Year of publication
1995
Pages
2107 - 2123
Database
ISI
SICI code
0020-7721(1995)26:11<2107:CROPBS>2.0.ZU;2-R
Abstract
This paper considers optical character recognition (OCR) of Bangla, th e second most popular script in the Indian subcontinent. A complete OC R system is described for documents of single Bangla font, where more than three hundred character shapes are recognized by a combination of template and feature-matching approach. Here the document image captu red by a flatbed scanner is subject to tilt correction, line, word and character segmentation, simple and compound character separation, fea ture extraction and finally character recognition. Some character occu rrence statistics have been computed to aid the recognition process. T he simple character recognition is done by a feature-based tree classi fier, and the compound character recognition involves a template match ing approach preceded by a feature-based grouping. At present, recogni tion accuracy of about 96% is obtained by the system.