Spatiotemporal segmentation and tracking of objects for visualization of videoconference image sequences

Citation
I. Kompatsiaris et Mg. Strintzis, Spatiotemporal segmentation and tracking of objects for visualization of videoconference image sequences, IEEE CIR SV, 10(8), 2000, pp. 1388-1402
Citations number
46
Categorie Soggetti
Eletrical & Eletronics Engineeing
Journal title
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY
ISSN journal
10518215 → ACNP
Volume
10
Issue
8
Year of publication
2000
Pages
1388 - 1402
Database
ISI
SICI code
1051-8215(200012)10:8<1388:SSATOO>2.0.ZU;2-5
Abstract
In this paper, a procedure is described for the segmentation, content-based coding, and visualization of videoconference image sequences. First, image sequence analysis is used to estimate the shape and motion parameters of t he person facing the camera. A spatiotemporal filter, taking into account t he intensity differences between consequent frames, is applied, in order to separate the moving person from the static background. The foreground is s egmented in a number of regions in order to identify the face. For this pur pose, we propose the novel procedure of K-Means with connectivity constrain t algorithm as a general segmentation algorithm combining several types of information including intensity, motion and compactness. In this algorithm, the use of spatiotemporal regions is introduced since a number of frames a re analyzed simultaneously and as a result, the same region is present in c onsequent frames. Based on this information, a 3-D ellipsoid is adapted to the person's face using an efficient and robust algorithm. The rigid 3-D mo tion is estimated next using a least median of squares approach. Finally, a Virtual Reality Modeling Language (VRML) file is created containing all th e above information; this file may he viewed by using any VRML 2.0 complian t browser.