A SYSTEM TO READ NAMES AND ADDRESSES ON TAX FORMS

Citation
Sn. Srihari et al., A SYSTEM TO READ NAMES AND ADDRESSES ON TAX FORMS, Proceedings of the IEEE, 84(7), 1996, pp. 1038-1049
Citations number
31
Categorie Soggetti
Engineering, Eletrical & Electronic
Journal title
ISSN journal
00189219
Volume
84
Issue
7
Year of publication
1996
Pages
1038 - 1049
Database
ISI
SICI code
0018-9219(1996)84:7<1038:ASTRNA>2.0.ZU;2-G
Abstract
The reading of names and addresses is one of the most complex tasks in automated forms processing. This paper describes an integrated real-t ime system to read names and addresses on tax forms of the U.S. Intern al Revenue Service. The Name and Address Block Reader (NABR) system ac cepts both machine-printed and hand-printed address block images as in put. The application software has two major steps: document analysis ( connected component analysis, address block extraction, label detectio n, hand-print/machine-print discrimination) and document recognition. Document recognition has two nonidentical streams for machine-print an d hand-print: the key steps are address parsing, character recognition , weld recognition, and postal database lookup, (ZIP+4 and City-State- ZIP files.) System output is a packet containing the results of recogn ition together with database access status file. Real-time throughput (8500 forms/h) is achieved by employing a loosely coupled multiprocess ing architecture where successive input images are distributed to avai lable address recognition processors. The functional architecture, sof tware design, system architecture, and the hardware implementation are described. Performance evaluation on machine-printed and handwritten addresses are presented.