Parallel sequence mining on shared-memory machines

Authors
Citation
Mj. Zaki, Parallel sequence mining on shared-memory machines, J PAR DISTR, 61(3), 2001, pp. 401-426
Citations number
18
Categorie Soggetti
Computer Science & Engineering
Journal title
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING
ISSN journal
07437315 → ACNP
Volume
61
Issue
3
Year of publication
2001
Pages
401 - 426
Database
ISI
SICI code
0743-7315(200103)61:3<401:PSMOSM>2.0.ZU;2-V
Abstract
We present pSPADE, a parallel algorithm for fast discovery of frequent sequ ences in large databases. pSPADE decomposes the original search space into smaller suffix-based classes. Each class can be solved in main-memory using efficient search techniques and simple join operations. Furthermore, each class can be solved independently on each processor requiring no synchroniz ation. However, dynamic interclass and intraclass load balancing must be ex ploited to ensure that each processor gets an equal amount of work. Experim ents on a 12 processor SGI Origin 2000 shared memory system show good speed up and excellent scaleup results. (C) 2001 Academic Press.