ITA
ENG

COMPUTER RECOGNITION OF PRINTED BANGLA SCRIPT

Authors

PAL U CHAUDHURI BB

Citation

U. Pal et Bb. Chaudhuri, COMPUTER RECOGNITION OF PRINTED BANGLA SCRIPT, International Journal of Systems Science, 26(11), 1995, pp. 2107-2123

Citations number

Categorie Soggetti

System Science","Computer Science Theory & Methods","Operatione Research & Management Science

Journal title

International Journal of Systems Science → ACNP

ISSN journal

00207721

Volume

Issue

Year of publication

1995

Pages

2107 - 2123

Database

ISI

SICI code

0020-7721(1995)26:11<2107:CROPBS>2.0.ZU;2-R

Abstract

This paper considers optical character recognition (OCR) of Bangla, th e second most popular script in the Indian subcontinent. A complete OC R system is described for documents of single Bangla font, where more than three hundred character shapes are recognized by a combination of template and feature-matching approach. Here the document image captu red by a flatbed scanner is subject to tilt correction, line, word and character segmentation, simple and compound character separation, fea ture extraction and finally character recognition. Some character occu rrence statistics have been computed to aid the recognition process. T he simple character recognition is done by a feature-based tree classi fier, and the compound character recognition involves a template match ing approach preceded by a feature-based grouping. At present, recogni tion accuracy of about 96% is obtained by the system.