ITA
ENG

Kalman-filtering speech enhancement method based on a voiced-unvoiced speech model

Authors

Goh, Z Tan, KC Tan, BTG

Citation

Z. Goh et al., Kalman-filtering speech enhancement method based on a voiced-unvoiced speech model, IEEE SPEECH, 7(5), 1999, pp. 510-524

Citations number

Categorie Soggetti

Eletrical & Eletronics Engineeing

Journal title

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING

ISSN journal

10636676 → ACNP

Volume

Issue

Year of publication

1999

Pages

510 - 524

Database

ISI

SICI code

1063-6676(199909)7:5<510:KSEMBO>2.0.ZU;2-K

Abstract

In this work, we are concerned with optimal estimation of clean speech from its noisy version based on a speech model we propose. We first propose a ( single) speech model which satisfactorily describes voiced and unvoiced spe ech and silence (i.e., pauses between speech utterances), and also allows f or exploitation of the long term characteristics of noise. We then reformul ate the model equations so as to facilitate subsequent application of the w ell-established Kalman filter for computing the optimal estimate of the cle an speech in the minimum-mean-square-error sense. Since the standard algori thm for Kalman filtering involves multiplications of very large matrices an d thus demands high computational cost, we devise a mathematically equivale nt algorithm which is computationally much more efficient, by exploiting th e sparsity of the matrices concerned. Next, we present the methods me use f or estimating the model parameters and give a complete description of the e nhancement process. Performance assessment based on spectrogram plots, obje ctive measures and informal subjective listening tests all indicate that ou r method gives consistently good results. As far as signal-to-noise ratio i s concerned, the improvements over existing methods can be as high as 4 dB.