Discovery of frequent DATALOG patterns

Citation
L. Dehaspe et H. Toivonen, Discovery of frequent DATALOG patterns, DATA M K D, 3(1), 1999, pp. 7-36
Citations number
67
Categorie Soggetti
AI Robotics and Automatic Control
Journal title
DATA MINING AND KNOWLEDGE DISCOVERY
ISSN journal
13845810 → ACNP
Volume
3
Issue
1
Year of publication
1999
Pages
7 - 36
Database
ISI
SICI code
1384-5810(199903)3:1<7:DOFDP>2.0.ZU;2-Y
Abstract
Discovery of frequent patterns has been studied in a variety of data mining settings. In its simplest form, known from association rule mining, the ta sk is to discover all frequent itemsets, i.e., all combinations of items th at are found in a sufficient number of examples. The fundamental task of as sociation rule and frequent set discovery has been extended in various dire ctions, allowing more useful patterns to be discovered with special purpose algorithms. We present WARMR, a general purpose inductive logic programmin g algorithm that addresses frequent query discovery: a very general DATALOG formulation of the frequent pattern discovery problem. The motivation for this novel approach is twofold. First, exploratory data mining isi well supported: WARMR offers the flexibility required to experim ent with standard and in particular novel settings not supported by special purpose algorithms. Also, application prototypes based on WARMR can be use d as benchmarks in the comparison and evaluation of new special purpose alg orithms. Second, the unified representation gives insight to the blurred pi cture of the frequent pattern discovery domain. Within the DATALOG formulat ion a number of dimensions appear that relink diverged settings. We demonstrate the frequent query approach and its use on two applications, one in alarm analysis, and one in a chemical toxicology domain.