A STATISTICAL PERSPECTIVE ON DATA MINING

Citation
Jrm. Hosking et al., A STATISTICAL PERSPECTIVE ON DATA MINING, Future generations computer systems, 13(2-3), 1997, pp. 117-134
Citations number
30
ISSN journal
0167739X
Volume
13
Issue
2-3
Year of publication
1997
Pages
117 - 134
Database
ISI
SICI code
0167-739X(1997)13:2-3<117:ASPODM>2.0.ZU;2-B
Abstract
Data mining can be regarded as a collection of methods for drawing inf erences from data. The aims of data mining, and some of its methods, o verlap with those of classical statistics. However, there are some phi losophical and methodological differences. We examine these difference s, and we describe three approaches to machine learning that have deve loped largely independently: classical statistics, Vapnik's statistica l learning theory, and computational learning theory. Comparing these approaches, we conclude that statisticians and data miners can profit by studying each other's methods and using a judiciously chosen combin ation of them.