DOCUMENT IMAGE-PROCESSING SYSTEM FOR NAME AND ADDRESS RECOGNITION

Citation
Sn. Srihari et al., DOCUMENT IMAGE-PROCESSING SYSTEM FOR NAME AND ADDRESS RECOGNITION, International journal of imaging systems and technology, 7(4), 1996, pp. 379-391
Citations number
31
Categorie Soggetti
Optics,"Engineering, Eletrical & Electronic
ISSN journal
08999457
Volume
7
Issue
4
Year of publication
1996
Pages
379 - 391
Database
ISI
SICI code
0899-9457(1996)7:4<379:DISFNA>2.0.ZU;2-R
Abstract
This article describes a real-time document image processing system. I ts objective is to recognize names and addresses from scanned address block images extracted from various tax forms of the United States Int ernal Revenue Service. The Name and Address Block Reader (NABR) system accepts both machine- and hand-printed address block images as input. Salient aspects of the system are presented, including document analy sis (connected component analysis, address block extraction, label det ection, hand-print/machine-print discrimination) and document recognit ion. Document recognition is performed in two nonidentical streams for machine-and hand-print; key steps are address parsing, character reco gnition, word recognition, and postal data base lookup (ZIP+4 and city -state-ZIP files). System output is a packet containing the results of recognition together with data base access status tile, Real-time thr oughput (8500 forms/h) is achieved by employing a loosely coupled mult iprocessing architecture where successive input images are distributed to available address recognition processors, The functional architect ure, software design, system architecture, and the hardware implementa tion are described. Performance evaluation on machine- and hand-writte n addresses are presented. (C) 1996 John Wiley & Sons, Inc.