I. Duntsch et G. Gediga, STATISTICAL EVALUATION OF ROUGH SET DEPENDENCY ANALYSIS, International journal of human-computer studies, 46(5), 1997, pp. 589-604
Citations number
17
Categorie Soggetti
Psychology,Ergonomics,"Computer Sciences","Controlo Theory & Cybernetics","Computer Science Cybernetics
Rough set data analysis (RSDA) has recently become a frequently studie
d symbolic method in data mining. Among other things, it is being used
for the extraction of rules from databases; it is, however, not clear
from within the methods of rough set analysis, whether the extracted
rules are valid. In this paper, we suggest to enhance RSDA by two simp
le statistical procedures, both based on randomization techniques, to
evaluate the validity of prediction based on the approximation quality
of attributes of rough set dependency analysis. The first procedure t
ests the casualness of a prediction to ensure that the prediction is n
ot based on only a few (casual) observations. The second procedure tes
ts the conditional casualness of an attribute within a prediction rule
. The procedures are applied to three data sets, originally published
in the context of rough set analysis. We argue that several claims of
these analyses need to be modified because of lacking validity, and th
at other possibly significant results were overlooked. (C) 1997 Academ
ic Press Limited.