ITA
ENG

Assessing the accuracy of prediction algorithms for classification: an overview

Authors

Baldi, P Brunak, S Chauvin, Y Andersen, CAF Nielsen, H

Citation

P. Baldi et al., Assessing the accuracy of prediction algorithms for classification: an overview, BIOINFORMAT, 16(5), 2000, pp. 412-424

Citations number

Categorie Soggetti

Multidisciplinary

Journal title

BIOINFORMATICS

ISSN journal

13674803 → ACNP

Volume

Issue

Year of publication

2000

Pages

412 - 424

Database

ISI

SICI code

1367-4803(200005)16:5<412:ATAOPA>2.0.ZU;2-R

Abstract

We provide a unified overview of methods that currently are widely used to assess the accuracy of prediction algorithms, from raw percentages, quadrat ic error measures and other distances, ann correlation coefficients, and to information theoretic measures such as relative entropy and mutual informa tion. We briefly discuss the advantages and disadvantages of each approach. For classification tasks, we derive new learning algorithms for the design of prediction systems by directly optimising the correlation coefficient. We observe and prove several results relating sensitivity nod specificity o f optimal systems. While the principles are general, we illustrate the appl icability on specific problems such as protein secondary structure and sign al peptide prediction.