ITA
ENG

Spatiotemporal segmentation and tracking of objects for visualization of videoconference image sequences

Authors

Kompatsiaris, I Strintzis, MG

Citation

I. Kompatsiaris et Mg. Strintzis, Spatiotemporal segmentation and tracking of objects for visualization of videoconference image sequences, IEEE CIR SV, 10(8), 2000, pp. 1388-1402

Citations number

Categorie Soggetti

Eletrical & Eletronics Engineeing

Journal title

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY

ISSN journal

10518215 → ACNP

Volume

Issue

Year of publication

2000

Pages

1388 - 1402

Database

ISI

SICI code

1051-8215(200012)10:8<1388:SSATOO>2.0.ZU;2-5

Abstract

In this paper, a procedure is described for the segmentation, content-based coding, and visualization of videoconference image sequences. First, image sequence analysis is used to estimate the shape and motion parameters of t he person facing the camera. A spatiotemporal filter, taking into account t he intensity differences between consequent frames, is applied, in order to separate the moving person from the static background. The foreground is s egmented in a number of regions in order to identify the face. For this pur pose, we propose the novel procedure of K-Means with connectivity constrain t algorithm as a general segmentation algorithm combining several types of information including intensity, motion and compactness. In this algorithm, the use of spatiotemporal regions is introduced since a number of frames a re analyzed simultaneously and as a result, the same region is present in c onsequent frames. Based on this information, a 3-D ellipsoid is adapted to the person's face using an efficient and robust algorithm. The rigid 3-D mo tion is estimated next using a least median of squares approach. Finally, a Virtual Reality Modeling Language (VRML) file is created containing all th e above information; this file may he viewed by using any VRML 2.0 complian t browser.