ITA
ENG

VISION TEXTURE FOR ANNOTATION

Authors

PICARD RW MINKA TP

Citation

Rw. Picard et Tp. Minka, VISION TEXTURE FOR ANNOTATION, Multimedia systems, 3(1), 1995, pp. 3-14

Citations number

Categorie Soggetti

Computer Sciences","Computer Science Theory & Methods","Computer Science Information Systems

Journal title

Multimedia systems → ACNP

ISSN journal

09424962

Volume

Issue

Year of publication

1995

Pages

3 - 14

Database

ISI

SICI code

0942-4962(1995)3:1<3:VTFA>2.0.ZU;2-9

Abstract

This paper demonstrates a new application of computer vision to digita l libraries - the use of texture for annotation, the description of co ntent. Vision-based annotation assists the user in attaching descripti ons to large sets of images and video. If a user labels a piece of an image as water, a texture model can be used to propagate this label to other ''visually similar'' regions. However, a serious problem is tha t no single model has been found that is good enough to match reliably human perception of similarity in pictures. Rather than using one mod el, the system described here knows several texture models, and is equ ipped with the ability to choose the one that ''best explains'' the re gions selected by the user for annotating. If none of these models suf fices, then it creates new explanations by combining models. Examples of annotations propagated by the system on natural scenes are given. T he system tem provides an average gain of four to one in label predict ion for a set of 98 images.