The popularity of digital images is rapidly increasing due to improving dig
ital imaging technologies and convenient availability facilitated by the In
ternet. However, how to find user-intended images from the Internet is nont
rivial. The main reason is that the Web images are usually not annotated us
ing semantic descriptors. In this article, we present an effective approach
to and a prototype system for image retrieval from the Internet using Web
mining. The system can also serve as a Web image search engine. One of the
key ideas in the approach is to extract the text information on the Web pag
es to semantically describe the images. The text description is then combin
ed with other low-level image features in the image similarity assessment.
Another main contribution of this work is that we apply data mining on the
log of users' feedback to improve image retrieval performance in three aspe
cts. First, the accuracy of the document space model of image representatio
n obtained from the Web pages is improved by removing clutter and irrelevan
t text information. Second, to construct the user space model of users' rep
resentation of images, which is then combined with the document space model
to eliminate mismatch between the page author's expression and the user's
understanding and expectation. Third, to discover the relationship between
low-level and high-level features, which is extremely useful for assigning
the low-level features' weights in similarity assessment.