Cursive handwriting recognition using hidden Markov models and a lexicon-driven level building algorithm

Citation
S. Procter et al., Cursive handwriting recognition using hidden Markov models and a lexicon-driven level building algorithm, IEE P-VIS I, 147(4), 2000, pp. 332-339
Citations number
28
Categorie Soggetti
Eletrical & Eletronics Engineeing
Journal title
IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING
ISSN journal
1350245X → ACNP
Volume
147
Issue
4
Year of publication
2000
Pages
332 - 339
Database
ISI
SICI code
1350-245X(200008)147:4<332:CHRUHM>2.0.ZU;2-K
Abstract
The authors describe a method for the recognition of cursively handwritten words using hidden Markov models (HMMs). The modelling methodology used has previously been successfully applied to the recognition of both degraded m achine-printed text and hand-printed numerals. A novel lexicon-driven level building (LDLB) algorithm is proposed, which incorporates a lexicon direct ly within the search procedure and maintains a list of plausible match sequ ences at each stage of the search, rather than decoding using only the most likely state sequence. A word recognition rate of 93.4% is achieved using a 713 word lexicon, compared to just 49.8% when the same lexicon is used to post-process the results produced by a standard level building algorithm. Various procedures are described for the normalisation of cursive script. R esults are presented on a single-author database of scanned text. It is sho wn how very high reliability up to near perfect recognition, can be achieve d by using a threshold to reject those word hypotheses to which the system assigns a low confidence. At 19% rejection, 99.2% of accepted words appeare d in the top two choices produced by the system, and 100% of the 1645 accep ted words were correctly recognised within the top eight choices.