J. Hochberg et al., AUTOMATIC SCRIPT IDENTIFICATION FROM DOCUMENT IMAGES USING CLUSTER-BASED TEMPLATES, IEEE transactions on pattern analysis and machine intelligence, 19(2), 1997, pp. 176-181
We describe an automated script identification system for typeset docu
ment images. Templates for each script are created by clustering textu
al symbols from a training set. Symbols from new images are compared t
o the templates to find the best script. Our current system processes
thirteen scripts with minimal preprocessing and high accuracy.