As digital video databases become more and more pervasive, finding vid
eo in large databases becomes a major problem, Because of the nature o
f video (streamed objects), accessing the content of such databases is
inherently a time-consuming operation. Enabling intelligent means of
video retrieval and rapid video viewing through the processing, analys
is, and interpretation of visual content are, therefore, important top
ics of research, In this paper, we survey the art of video query and r
etrieval and propose a framework for video-query formulation and video
retrieval based on an iterated sequence of navigating, searching, bro
wsing, and viewing, We describe how the rich information media of vide
o in the forms of image, audio, and text can be appropriately used in
each stage of the search process to retrieve relevant segments. Also,
we address the problem of automatic video annotation-attaching meaning
s to video segments to aid the query steps. Subsequently, we present a
novel framework of structural video analysis that focuses on the proc
essing of high-level features as well as low-level visual cues. This p
rocessing augments the semantic interpretation of a wide variety of lo
ng video segments and assists in the search, navigation, and retrieval
of video. We describe several such techniques.