Because of the media digitization, a large amount of information such as sp
eech, audio and video data is produced everyday. In order to retrieve data
from these databases quickly and precisely, multimedia technologies for str
ucturing and retrieving of speech, audio and video data are strongly requir
ed. In this paper, we overview the multimedia technologies such as structur
ing and retrieval of speech, audio and video data, speaker indexing, audio
summarization and cross media retrieval existing today for TV news database
. The main purpose of structuring is to produce tables of contents and indi
ces from audio and video data automatically. In order to make these technol
ogies feasible, first, processing units such as words on audio data and sho
ts on video data are extracted. On a second step, they are meaningfully int
egrated into topics. Furthermore, the units extracted from different types
of media are integrated for higher functions.