ITA
ENG

Comparing case-based reasoning classifiers for predicting high risk software components

Authors

El Emam, K Benlarbi, S Goel, N Rai, SN

Citation

K. El Emam et al., Comparing case-based reasoning classifiers for predicting high risk software components, J SYST SOFT, 55(3), 2001, pp. 301-320

Citations number

Categorie Soggetti

Computer Science & Engineering

Journal title

JOURNAL OF SYSTEMS AND SOFTWARE

ISSN journal

01641212 → ACNP

Volume

Issue

Year of publication

2001

Pages

301 - 320

Database

ISI

SICI code

0164-1212(20010115)55:3<301:CCRCFP>2.0.ZU;2-4

Abstract

Case-based reasoning (CBR) has been proposed for predicting the risk class of software components. Risky components can be defined as those that are f ault-prone, or those that require a large amount of effort to maintain. Thu s far evaluative studies of CBR classifiers have been promising, showing th at their predictive performance is as good as or better than other types of classifiers. However, a CBR classifier can be instantiated in different wa ys by varying its parameters, and it is not clear which combination of para meters provides the best performance. In this paper we evaluate the perform ance of a CBR classifier with different parameters, namely: (a) different d istance measures, (b) different standardization techniques, (c) use or non- use of weights, and (d) the number of nearest neighbors to use for the pred iction. In total, we compared 30 different CBR classifiers. The study was c onducted with a data set from a large real-time system, and the objective w as to predict the fault-proneness of its components. Our results indicate t hat there is no difference in prediction performance when using any combina tion of parameters. Based on these results, we recommend using a simple CBR classifier with Euclidean distance, z-score standardization, no weighting scheme, and selecting the single nearest neighbor for prediction. The advan tage of such a classifier is its intuitive appeal to nonspecialists, and th e fact that it performs as well as more complex classifiers. (C) 2001 Elsev ier Science Inc. All rights reserved.