ITA
ENG

Optimal techniques for class-dependent attribute discretization

Authors

Bryson, N Joseph, A

Citation

N. Bryson et A. Joseph, Optimal techniques for class-dependent attribute discretization, J OPER RES, 52(10), 2001, pp. 1130-1143

Citations number

Categorie Soggetti

Management,"Engineering Mathematics

Journal title

JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY

ISSN journal

01605682 → ACNP

Volume

Issue

Year of publication

2001

Pages

1130 - 1143

Database

ISI

SICI code

0160-5682(200110)52:10<1130:OTFCAD>2.0.ZU;2-I

Abstract

Preprocessing of raw data has been shown to improve performance of knowledg e discovery processes. Discretization of quantitative attributes is a key c omponent of preprocessing and has the potential to greatly impact the effic iency of the process and the quality of its outcomes. In attribute discreti zation, the value domain of an attribute is partitioned into a finite set o f intervals so that the attribute can be described using a small number of discrete representations. Discretization therefore involves two decisions, on the number of intervals and the placement of interval boundaries. Previo us approaches for quantitative attribute discretization have used heuristic algorithms to identify partitions of the attribute value domain. Therefore , these approaches cannot be guaranteed to provide the optimal solution for the given discretization criterion and number of intervals. In this paper, we use linear programming (LP) methods to formulate the attribute discreti zation problem. The LP formulation allows the discretization criterion and the number of intervals to be integral considerations of the problem. We co nduct experiments and identify optimal solutions for various discretization criteria and numbers of intervals.