ITA
ENG

PHYSIOLOGICALLY MOTIVATED MODELING OF THE VOICE SOURCE IN ARTICULATORY ANALYSIS SYNTHESIS

Authors

CRANEN B SCHROETER J

Citation

B. Cranen et J. Schroeter, PHYSIOLOGICALLY MOTIVATED MODELING OF THE VOICE SOURCE IN ARTICULATORY ANALYSIS SYNTHESIS, Speech communication, 19(1), 1996, pp. 1-19

Citations number

Categorie Soggetti

Communication,"Language & Linguistics

Journal title

Speech communication → ACNP

ISSN journal

01676393

Volume

Issue

Year of publication

1996

Pages

1 - 19

Database

ISI

SICI code

0167-6393(1996)19:1<1:PMMOTV>2.0.ZU;2-F

Abstract

This paper describes the implementation of a new parametric model of t he glottal geometry aimed at improving male and female speech synthesi s in the framework of articulatory analysis synthesis. The model repre sents glottal geometry in terms of inlet and outlet area waveforms and is controlled by parameters that are tightly coupled to physiology, s uch as vocal fold abduction. It is embedded in an articulatory analysi s synthesis system (articulatory speech mimic). To introduce naturally occurring details in our synthetic glottal flow waveforms, we modelle d two different kinds of leakage: a ''linked leak'' and a ''parallel c hink''. While the first is basically an incomplete glottal closure, th e latter models a second glottal duct that is independent of the membr anous (vibrating) part of the glottis. Characteristic for both types o f leaks is that they increase de-flow and source/tract interaction. A linked leak, however, gives rise to a steeper roll-off of the entire g lottal flow spectrum, whereas a parallel chink decreases the energy of the lower frequencies more than the higher frequencies. In fact, for a parallel chink, the slope at the higher freqencies is more or less t he same as in the no-leakage case.