ITA
ENG

Parallel sequence mining on shared-memory machines

Authors

Zaki, MJ

Citation

Mj. Zaki, Parallel sequence mining on shared-memory machines, J PAR DISTR, 61(3), 2001, pp. 401-426

Citations number

Categorie Soggetti

Computer Science & Engineering

Journal title

JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING

ISSN journal

07437315 → ACNP

Volume

Issue

Year of publication

2001

Pages

401 - 426

Database

ISI

SICI code

0743-7315(200103)61:3<401:PSMOSM>2.0.ZU;2-V

Abstract

We present pSPADE, a parallel algorithm for fast discovery of frequent sequ ences in large databases. pSPADE decomposes the original search space into smaller suffix-based classes. Each class can be solved in main-memory using efficient search techniques and simple join operations. Furthermore, each class can be solved independently on each processor requiring no synchroniz ation. However, dynamic interclass and intraclass load balancing must be ex ploited to ensure that each processor gets an equal amount of work. Experim ents on a 12 processor SGI Origin 2000 shared memory system show good speed up and excellent scaleup results. (C) 2001 Academic Press.