WaveCluster: a wavelet-based clustering approach for spatial data in very large databases

Citation
G. Sheikholeslami et al., WaveCluster: a wavelet-based clustering approach for spatial data in very large databases, VLDB J, 8(3-4), 2000, pp. 289-304
Citations number
34
Categorie Soggetti
Computer Science & Engineering
Journal title
VLDB JOURNAL
ISSN journal
10668888 → ACNP
Volume
8
Issue
3-4
Year of publication
2000
Pages
289 - 304
Database
ISI
SICI code
1066-8888(200002)8:3-4<289:WAWCAF>2.0.ZU;2-W
Abstract
Many applications require the management of spatial data in a multidimensio nal feature space. Clustering large spatial databases is an important probl em, which tries to find the densely populated regions in the feature space to be used in data mining, knowledge discovery, or efficient information re trieval. A good clustering approach should be efficient and detect clusters of arbitrary shape. It must be insensitive to the noise (outliers) and the order of input data. We propose WaveCluster, a novel clustering approach b ased on wavelet transforms, which satisfies all the above requirements. Usi ng the multiresolution property of wavelet transforms, we can effectively i dentify arbitrarily shaped clusters at different degrees of detail. We also demonstrate that WaveCluster is highly efficient in terms of time complexi ty. Experimental results on very large datasets are presented, which show t he efficiency and effectiveness of the proposed approach compared to the ot her recent clustering methods.