AN EMPIRICAL-COMPARISON OF EXPERT-DERIVED AND DATA-DERIVED CLASSIFICATION TREES

Citation
M. Chiogna et al., AN EMPIRICAL-COMPARISON OF EXPERT-DERIVED AND DATA-DERIVED CLASSIFICATION TREES, Statistics in medicine, 15(2), 1996, pp. 157-169
Citations number
21
Categorie Soggetti
Statistic & Probability","Medicine, Research & Experimental","Public, Environmental & Occupation Heath","Statistic & Probability","Medical Informatics
Journal title
ISSN journal
02776715
Volume
15
Issue
2
Year of publication
1996
Pages
157 - 169
Database
ISI
SICI code
0277-6715(1996)15:2<157:AEOEAD>2.0.ZU;2-2
Abstract
Classification trees provide an attractively transparent discriminatio n technique, and may be derived from both expert opinion and from data analysis, We consider a real and complex problem concerning the diagn osis of babies with suspected critical congenital heart disease into o ne of 27 classes, A full loss matrix for all possible misclassificatio ns was obtained from clinical assessments. A tree derived from expert opinion was compared with those derived from analysis of 571 past case s, both for the full problem and for a subset of 6 diseases. Automatic methods for tree creation and pruning were found to have problems for rare diseases, and hand-pruning was carried out. Inclusion of costs l ed to much improved clinical performance, even for trees that had orig inally been constructed to minimize classification errors. The expert tree showed a specific building strategy that could not be reproduced automatically. The expert tree generally outperformed those derived fr om data, particularly in the ability to identify important composite f eatures.