MODEL-BASED INTERPRETATION OF COMPLEX AND VARIABLE IMAGES

Citation
Cj. Taylor et al., MODEL-BASED INTERPRETATION OF COMPLEX AND VARIABLE IMAGES, Philosophical transactions-Royal Society of London. Biological sciences, 352(1358), 1997, pp. 1267-1274
Citations number
40
Categorie Soggetti
Biology
ISSN journal
09628436
Volume
352
Issue
1358
Year of publication
1997
Pages
1267 - 1274
Database
ISI
SICI code
0962-8436(1997)352:1358<1267:MIOCAV>2.0.ZU;2-S
Abstract
The ultimate goal of machine vision is image understanding-the ability not only to recover image structure but also to know what it represen ts. By definition, this involves the use of models which describe and label the expected structure of the world. Over the past decade, model -based vision has been applied successfully to images of man-made obje cts. It has proved much more difficult to develop model-based approach es to the interpretation of images of complex and variable structures such as faces or the internal organs of the human body (as visualized in medical images). In such cases it has been problematic even to reco ver image structure reliably without a model to organize the often noi sy and incomplete image evidence. The key problem is that of variabili ty. To be useful, a model needs to be specific-that is, to be capable of representing only 'legal' examples of the modelled object(s). It ha s proved difficult to achieve this whilst allowing for natural variabi lity. Recent developments have overcome this problem; it has been show n that specific patterns of variability in shape and grey-level appear ance can be captured by statistical models that can be used directly i n image interpretation. The details of the approach are outlined and p ractical examples from medical image interpretation and face recogniti on are used to illustrate how previously intractable problems can now be tackled successfully. It is also interesting to ask whether these r esults provide any possible insights into natural vision; for example, we show that the apparent changes in shape which result from viewing three-dimensional objects from different viewpoints can be modelled qu ite well in two dimensions; this may lend some support to the 'charact eristic views' model of natural vision.