STATISTICAL EVALUATION OF ROUGH SET DEPENDENCY ANALYSIS

Citation
I. Duntsch et G. Gediga, STATISTICAL EVALUATION OF ROUGH SET DEPENDENCY ANALYSIS, International journal of human-computer studies, 46(5), 1997, pp. 589-604
Citations number
17
Categorie Soggetti
Psychology,Ergonomics,"Computer Sciences","Controlo Theory & Cybernetics","Computer Science Cybernetics
ISSN journal
10715819
Volume
46
Issue
5
Year of publication
1997
Pages
589 - 604
Database
ISI
SICI code
1071-5819(1997)46:5<589:SEORSD>2.0.ZU;2-S
Abstract
Rough set data analysis (RSDA) has recently become a frequently studie d symbolic method in data mining. Among other things, it is being used for the extraction of rules from databases; it is, however, not clear from within the methods of rough set analysis, whether the extracted rules are valid. In this paper, we suggest to enhance RSDA by two simp le statistical procedures, both based on randomization techniques, to evaluate the validity of prediction based on the approximation quality of attributes of rough set dependency analysis. The first procedure t ests the casualness of a prediction to ensure that the prediction is n ot based on only a few (casual) observations. The second procedure tes ts the conditional casualness of an attribute within a prediction rule . The procedures are applied to three data sets, originally published in the context of rough set analysis. We argue that several claims of these analyses need to be modified because of lacking validity, and th at other possibly significant results were overlooked. (C) 1997 Academ ic Press Limited.