A tree projection algorithm for generation of frequent item sets

Citation
Rc. Agarwal et al., A tree projection algorithm for generation of frequent item sets, J PAR DISTR, 61(3), 2001, pp. 350-371
Citations number
17
Categorie Soggetti
Computer Science & Engineering
Journal title
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING
ISSN journal
07437315 → ACNP
Volume
61
Issue
3
Year of publication
2001
Pages
350 - 371
Database
ISI
SICI code
0743-7315(200103)61:3<350:ATPAFG>2.0.ZU;2-Q
Abstract
In this paper we propose algorithms for generation of frequent item sets by successive construction of the nodes of a lexicographic tree of item sets. We discuss different strategies in generation and traversal of the lexicog raphic tree such as breadth-first search, depth-first search, or a combinat ion of the two. These techniques provide different trade-offs in terms of t he I/O, memory, and computational time requirements. We use the hierarchica l structure of the lexicographic tree to successively project transactions at each node of the lexicographic tree and use matrix counting on this redu ced set of transactions for finding frequent item sets. We tested our algor ithm on both real and synthetic data. We provide an implementation of the t ree projection method which is up to one order of magnitude faster than oth er recent techniques in the literature. The algorithm has a well-structured data access pattern which provides data locality and reuse of data for mul tiple levels of the cache. We also discuss methods for parallelization of t he TreeProjection algorithm. (C) 2001 Academic Press.