Integrated information mining for texts, images, and videos

Citation
O. Herzog et al., Integrated information mining for texts, images, and videos, COMPUT GRAP, 22(6), 1998, pp. 675-685
Citations number
31
Categorie Soggetti
Computer Science & Engineering
Journal title
COMPUTERS & GRAPHICS
ISSN journal
00978493 → ACNP
Volume
22
Issue
6
Year of publication
1998
Pages
675 - 685
Database
ISI
SICI code
0097-8493(199812)22:6<675:IIMFTI>2.0.ZU;2-B
Abstract
The large amount and the ubiquitous availability of multimedia information (e.g., video, audio, image, and also text documents) require efficient, eff ective, and automatic annotation and retrieval methods. As videos start to play an even more important role in multimedia, content-based retrieval of videos becomes an issue, especially as there should be an integrated method ology for all types of multimedia documents. Our approach for the integrated retrieval of videos, images, and text compr ises three necessary steps: First, the detection and extraction of shots fr om a video, second. the construction of a still image from the frames in a shot. This is achieved by an extraction of ky frames or a mosaicing techniq ue. The result is a single image visualization of a shot, which in turn can be analyzed by the ImageMiner double dagger(TM) system. The ImageMiner system was developed in cooperation with IBM at the Universi ty of Bremen in the Image Processing Department of the Center for Computing Technologies. It realizes the content-based retrieval of single images thr ough a novel combination of techniques and methods from computer vision and artificial intelligence. Its output is a textual description of an image, and thus in our case, of the static elements of a video shot. In this way, the annotations of a video can be indexed with standard text retrieval syst ems, along with text documents or annotations of other multimedia documents , thus ensuring an integrated interface for all kinds of multimedia documen ts. (C) 1999 Elsevier Science Ltd. All rights reserved.