A MULTISTRATEGY APPROACH TO RELATIONAL KNOWLEDGE DISCOVERY IN DATABASES

Citation
K. Morik et P. Brockhausen, A MULTISTRATEGY APPROACH TO RELATIONAL KNOWLEDGE DISCOVERY IN DATABASES, Machine learning, 27(3), 1997, pp. 287-312
Citations number
36
Categorie Soggetti
Computer Sciences","Computer Science Artificial Intelligence",Neurosciences
Journal title
ISSN journal
08856125
Volume
27
Issue
3
Year of publication
1997
Pages
287 - 312
Database
ISI
SICI code
0885-6125(1997)27:3<287:AMATRK>2.0.ZU;2-S
Abstract
When learning from very large databases, the reduction of complexity i s extremely important. Two extremes of making knowledge discovery in d atabases (KDD) feasible have been put forward. One extreme is to choos e a very simple hypothesis language, thereby being capable of very fas t learning on real-world databases. The opposite extreme is to select a small data set, thereby being able to learn very expressive (first-o rder logic) hypotheses. A multistrategy approach allows one to include most of these advantages and exclude most of the disadvantages. Simpl er learning algorithms detect hierarchies which are used to structure the hypothesis space for a more complex learning algorithm. The better structured the hypothesis space is, the better learning can prune awa y uninteresting or losing hypotheses and the faster it becomes. We hav e combined inductive logic programming (ILP) directly with a relationa l database management system. The ILP algorithm is controlled in a mod el-driven way by the user and in a data-driven way by structures that are induced by three simple learning algorithms.