PHYSIOLOGICALLY MOTIVATED MODELING OF THE VOICE SOURCE IN ARTICULATORY ANALYSIS SYNTHESIS

Citation
B. Cranen et J. Schroeter, PHYSIOLOGICALLY MOTIVATED MODELING OF THE VOICE SOURCE IN ARTICULATORY ANALYSIS SYNTHESIS, Speech communication, 19(1), 1996, pp. 1-19
Citations number
15
Categorie Soggetti
Communication,"Language & Linguistics
Journal title
ISSN journal
01676393
Volume
19
Issue
1
Year of publication
1996
Pages
1 - 19
Database
ISI
SICI code
0167-6393(1996)19:1<1:PMMOTV>2.0.ZU;2-F
Abstract
This paper describes the implementation of a new parametric model of t he glottal geometry aimed at improving male and female speech synthesi s in the framework of articulatory analysis synthesis. The model repre sents glottal geometry in terms of inlet and outlet area waveforms and is controlled by parameters that are tightly coupled to physiology, s uch as vocal fold abduction. It is embedded in an articulatory analysi s synthesis system (articulatory speech mimic). To introduce naturally occurring details in our synthetic glottal flow waveforms, we modelle d two different kinds of leakage: a ''linked leak'' and a ''parallel c hink''. While the first is basically an incomplete glottal closure, th e latter models a second glottal duct that is independent of the membr anous (vibrating) part of the glottis. Characteristic for both types o f leaks is that they increase de-flow and source/tract interaction. A linked leak, however, gives rise to a steeper roll-off of the entire g lottal flow spectrum, whereas a parallel chink decreases the energy of the lower frequencies more than the higher frequencies. In fact, for a parallel chink, the slope at the higher freqencies is more or less t he same as in the no-leakage case.