The role of occam's razor in knowledge discovery

Authors
Citation
P. Domingos, The role of occam's razor in knowledge discovery, DATA M K D, 3(4), 1999, pp. 409-425
Citations number
102
Categorie Soggetti
AI Robotics and Automatic Control
Journal title
DATA MINING AND KNOWLEDGE DISCOVERY
ISSN journal
13845810 → ACNP
Volume
3
Issue
4
Year of publication
1999
Pages
409 - 425
Database
ISI
SICI code
1384-5810(199912)3:4<409:TROORI>2.0.ZU;2-D
Abstract
Many KDD systems incorporate an implicit or explicit preference for simpler models, but this use of "Occam's razor" has been strongly criticized by se veral authors (e.g., Schaffer, 1993; Webb, 1996). This controversy arises p artly because Occam's razor has been interpreted in two quite different way s. The first interpretation (simplicity is a goal in itself) is essentially correct, but is at heart a preference for more comprehensible models. The second interpretation (simplicity leads to greater accuracy) is much more p roblematic. A critical review of the theoretical arguments for and against it shows that it is unfounded as a universal principle, and demonstrably fa lse. A review of empirical evidence shows that it also fails as a practical heuristic. This article argues that its continued use in KDD risks causing significant opportunities to be missed, and should therefore be restricted to the comparatively few applications where it is appropriate. The article proposes and reviews the use of domain constraints as an alternative for a voiding overfitting, and examines possible methods for handling the accurac y-comprehensibility trade-off.