Efficient computation of iceberg cubes with complex measures

Citation
Jw. Han et al., Efficient computation of iceberg cubes with complex measures, SIG RECORD, 30(2), 2001, pp. 1-12
Citations number
16
Categorie Soggetti
Computer Science & Engineering
Journal title
SIGMOD RECORD
ISSN journal
01635808 → ACNP
Volume
30
Issue
2
Year of publication
2001
Pages
1 - 12
Database
ISI
SICI code
0163-5808(200106)30:2<1:ECOICW>2.0.ZU;2-4
Abstract
It is often too expensive to compute and materialize a complete high-dimens ional data cube. Computing an iceberg cube, which contains only aggregates above certain thresholds, is an effective way to derive nontrivial multidim ensional aggregations for OLAP and data mining. In this paper, we study efficient methods for computing iceberg cubes with some popularly used complex measures, such as average, and develop a method ology that adopts a weaker but anti-monotonic condition for testing and pru ning search space. In particular, for efficient computation of iceberg cube s with the average measure, we propose a top-k average pruning method and e xtend two previously studied methods, Apriori and BUC, to Top-k Apriori and Top-k BUC. To further improve the performance, an interesting hypertree st ructure, called H-tree, is designed and a new iceberg cubing method, called Top-k H-Cubing, is developed. Our performance study shows that Top-k BUC a nd Top-k H-Cubing are two promising candidates for scalable computation, an d Top-k H-Cubing has better performance in most cases.