An efficient algorithm for mining association rules for large itemsets in large databases: from sequential to parallel

Citation
Aky. Wong et al., An efficient algorithm for mining association rules for large itemsets in large databases: from sequential to parallel, ENG INTEL S, 8(2), 2000, pp. 109-117
Citations number
17
Categorie Soggetti
AI Robotics and Automatic Control
Journal title
ENGINEERING INTELLIGENT SYSTEMS FOR ELECTRICAL ENGINEERING AND COMMUNICATIONS
ISSN journal
14728915 → ACNP
Volume
8
Issue
2
Year of publication
2000
Pages
109 - 117
Database
ISI
SICI code
1472-8915(200006)8:2<109:AEAFMA>2.0.ZU;2-E
Abstract
This paper addresses three issues: (a) development of a new algorithm for m ining association rules efficiently from large databases in both sequential and distributed environments, (b) parallelization of this new algorithm to enhance performance gain by distributed parallelism, and (c) improvement o f this gain by properly steered scalability. The new algorithm is derived f rom the traditional Apriori approach by adding features to improve mining p erformance. These features include the binary encoding mechanism and its lo garithmic decoding counterpart. There is a need to parallelize the mining p rocess only when the overlay structure for the binary encoded structure for the database is too big and requires frequent time-consuming data swapping during the mining process. The scale of the parallelization, however, shou ld be tuned down gradually by scalability so that the computation-to-commun ication ratio can be kept at a reasonable level to maintain the benefit fro m parallelism.