Clustering is the unsupervised classification of patterns (observations, da
ta items, or feature vectors) into groups (clusters). The clustering proble
m has been addressed in many contexts and by researchers in many discipline
s; this reflects its broad appeal and usefulness as one of the steps in exp
loratory data analysis. However, clustering is a difficult problem combinat
orially, and differences in assumptions and contexts in different communiti
es has made the transfer of useful generic concepts and methodologies slow
to occur. This paper presents an overview of pattern clustering methods fro
m a statistical pattern recognition perspective, with a goal of providing u
seful advice and references to fundamental concepts accessible to the broad
community of clustering practitioners. We present a taxonomy of clustering
techniques, and identify cross-cutting themes and recent advances. We also
describe some important applications of clustering algorithms such as imag
e segmentation, object recognition, and information retrieval.