ITA
ENG

MOS AND PAIR COMPARISON COMBINED METHODS FOR QUALITY EVALUATION OF TEXT-TO-SPEECH SYSTEMS

Authors

SALZA PL FOTI E NEBBIA L OREGLIA M

Citation

Pl. Salza et al., MOS AND PAIR COMPARISON COMBINED METHODS FOR QUALITY EVALUATION OF TEXT-TO-SPEECH SYSTEMS, Acustica, 82(4), 1996, pp. 650-656

Citations number

Categorie Soggetti

Acoustics

Journal title

Acustica → ACNP

ISSN journal

14367947

Volume

Issue

Year of publication

1996

Pages

650 - 656

Database

ISI

SICI code

1436-7947(1996)82:4<650:MAPCCM>2.0.ZU;2-4

Abstract

The overall quality of three Text-To-Speech (TTS) synthesis systems fo r Italian with common prosodic control but different diphones and synt hesizers was evaluated by means of the combined application of Mean Op inion Score and Pair Comparison methods. Direct comparison between the two methods serves to validate MOS, which is the the technique recomm ended by CCITT for synthesis evaluation. In the MOS experiment, assess ment also included three types of natural speech (normal and degraded) as reference. Eighteen subjects expressed 2880 MOS judgements and mad e 720 comparisons in all. The results obtained from the two methods sh owed good agreement. The most important MOS voice parameters used by l isteners for differentiating the systems were Global Impression, Voice , Articulation and Pronunciation. The diphones appeared to contribute most to the different judgements, whereas synthesizers were not percei ved as different by listeners. This experiment provides positive verif ication of interlaboratory reproducibility of MOS, which proved to be an effective technique for overall assessment of TTS quality.