Automatic caption localization in compressed video

Citation
Y. Zhong et al., Automatic caption localization in compressed video, IEEE PATT A, 22(4), 2000, pp. 385-392
Citations number
22
Categorie Soggetti
AI Robotics and Automatic Control
Journal title
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE
ISSN journal
01628828 → ACNP
Volume
22
Issue
4
Year of publication
2000
Pages
385 - 392
Database
ISI
SICI code
0162-8828(200004)22:4<385:ACLICV>2.0.ZU;2-G
Abstract
We present a method to automatically localize captions in JPEG compressed i mages and the I-frames of MPEG compressed videos. Caption text regions are segmented from background images using their distinguishing texture charact eristics. Unlike previously published methods which fully decompress the vi deo sequence before extracting the text regions, this method locates candid ate caption text regions directly in the DCT compressed domain using the in tensity variation information encoded in the DCT domain. Therefore, only a very small amount of decoding is required. The proposed algorithm takes abo ut 0.006 second to process a 240 x 350 image and achieves a recall rate of 99.17 percent while falsely accepting about 1.87 percent nontext DCT blocks on a variety of MPEG compressed videos containing more than 2,300 I-frames .