ITA
ENG

Automatic scoring of pronunciation quality

Authors

Neumeyer, L Franco, H Digalakis, V Weintraub, M

Citation

L. Neumeyer et al., Automatic scoring of pronunciation quality, SPEECH COMM, 30(2-3), 2000, pp. 83-93

Citations number

Categorie Soggetti

Computer Science & Engineering

Journal title

SPEECH COMMUNICATION

ISSN journal

01676393 → ACNP

Volume

Issue

2-3

Year of publication

2000

Pages

83 - 93

Database

ISI

SICI code

0167-6393(200002)30:2-3<83:ASOPQ>2.0.ZU;2-9

Abstract

We present a paradigm for the automatic assessment of pronunciation quality by machine. In this scoring paradigm, both native and nonnative speech dat a is collected and a database of human-expert ratings is created to enable the development of a variety of machine scores. We first discuss issues rel ated to the design of speech databases and the reliability of human ratings . We then address pronunciation evaluation as a prediction problem, trying to predict the grade a human expert would assign to a particular skill. Usi ng the speech and the expert-ratings databases, we build statistical models and introduce different machine scores that can be used as predictor varia bles. We validate these machine scores on the Voice Interactive Language Tr aining System (VILTS) corpus, evaluating the pronunciation of American spea kers speaking French and we show that certain machine scores, like the log- posterior and the normalized duration, achieve a correlation with the targe ted human grades that is comparable to the human-to-human correlation when a sufficient amount of speech data is available. (C) 2000 Elsevier Science B.V. All rights reserved.