SPEECH FORMANT FREQUENCY AND BANDWIDTH TRACKING USING MULTIBAND ENERGY DEMODULATION

Citation
A. Potamianos et P. Maragos, SPEECH FORMANT FREQUENCY AND BANDWIDTH TRACKING USING MULTIBAND ENERGY DEMODULATION, The Journal of the Acoustical Society of America, 99(6), 1996, pp. 3795-3806
Citations number
27
Categorie Soggetti
Acoustics
ISSN journal
00014966
Volume
99
Issue
6
Year of publication
1996
Pages
3795 - 3806
Database
ISI
SICI code
0001-4966(1996)99:6<3795:SFFABT>2.0.ZU;2-E
Abstract
In this paper, the amplitude and frequency (AM-FM) modulation model an d a multiband demodulation analysis scheme are applied to formant freq uency and bandwidth tracking of speech signals. Filtering by a bank of Gabor bandpass filters is performed to isolate each speech resonance in the signal. Next, the amplitude envelope (AM) and instantaneous fre quency (FM) are estimated for each band using the energy separation al gorithm (ESA). Short-time formant frequency and bandwidth estimates ar e obtained from the instantaneous amplitude and frequency signals; two frequency estimates are proposed and their relative merits are discus sed. The short-time estimates are used to compute the formant location s and bandwidths. Performance and computational issues of the algorith m are discussed. Overall, multiband demodulation analysis (MDA) is sho wn to be a useful tool for extracting information from the speech reso nances in the time-frequency plane. (C) 1996 Acoustical Society of Ame rica.