MBR-PSOLA - TEXT-TO-SPEECH SYNTHESIS BASED ON AN MBE RE-SYNTHESIS OF THE SEGMENTS DATABASE

Authors
Citation
T. Dutoit et H. Leich, MBR-PSOLA - TEXT-TO-SPEECH SYNTHESIS BASED ON AN MBE RE-SYNTHESIS OF THE SEGMENTS DATABASE, Speech communication, 13(3-4), 1993, pp. 435-440
Citations number
11
Categorie Soggetti
Communication,"Language & Linguistics
Journal title
ISSN journal
01676393
Volume
13
Issue
3-4
Year of publication
1993
Pages
435 - 440
Database
ISI
SICI code
0167-6393(1993)13:3-4<435:M-TSBO>2.0.ZU;2-S
Abstract
The use of the Time-Domain Pitch Synchronous OverLap-Add (TD-PSOLA) al gorithm in a Text-To-Speech synthesizer is reviewed. Its drawbacks are underlined and three conditions on the speech database are examined. In order to satisfy them, a previously described high quality resynthe sis process is developed and enhanced, which makes use of the well-kno wn Multi-Band Excited (MBE) model. An important by product of this ope ration is that optimal Pitch Marking turns out to be automatic. A temp oral interpolation block is finally added. The resulting Multi-Band Re synthesis Pitch Synchronous OverLap Add (MBR-PSOLA) synthesis algorith m supports spectral interpolation between voiced parts of segments, wi th virtually no increase in complexity. It provides the basis of a hig h-quality Text-To-Speech (TTS) synthesizer.