MOS AND PAIR COMPARISON COMBINED METHODS FOR QUALITY EVALUATION OF TEXT-TO-SPEECH SYSTEMS

Citation
Pl. Salza et al., MOS AND PAIR COMPARISON COMBINED METHODS FOR QUALITY EVALUATION OF TEXT-TO-SPEECH SYSTEMS, Acustica, 82(4), 1996, pp. 650-656
Citations number
14
Categorie Soggetti
Acoustics
Journal title
ISSN journal
14367947
Volume
82
Issue
4
Year of publication
1996
Pages
650 - 656
Database
ISI
SICI code
1436-7947(1996)82:4<650:MAPCCM>2.0.ZU;2-4
Abstract
The overall quality of three Text-To-Speech (TTS) synthesis systems fo r Italian with common prosodic control but different diphones and synt hesizers was evaluated by means of the combined application of Mean Op inion Score and Pair Comparison methods. Direct comparison between the two methods serves to validate MOS, which is the the technique recomm ended by CCITT for synthesis evaluation. In the MOS experiment, assess ment also included three types of natural speech (normal and degraded) as reference. Eighteen subjects expressed 2880 MOS judgements and mad e 720 comparisons in all. The results obtained from the two methods sh owed good agreement. The most important MOS voice parameters used by l isteners for differentiating the systems were Global Impression, Voice , Articulation and Pronunciation. The diphones appeared to contribute most to the different judgements, whereas synthesizers were not percei ved as different by listeners. This experiment provides positive verif ication of interlaboratory reproducibility of MOS, which proved to be an effective technique for overall assessment of TTS quality.