X. Menendez-pidal et al., Compensation of channel and noise distortions combining normalization and speech enhancement techniques, SPEECH COMM, 34(1-2), 2001, pp. 115-126
This paper introduces two techniques to obtain robust speech recognition de
vices in mismatch conditions (additive noise mismatch and channel mismatch)
. The first algorithm, adaptive Gaussian attenuation algorithm (AGA), is a
speech enhancement technique developed to reduce the effects of additive ba
ckground noise in a wide range of signal noise ratio (SNR) and noise condit
ions. The technique is closely related to the classical noise spectral subt
raction (SS) scheme, but in the proposed model the mean and variance of noi
se are used to better attenuate the noise. Information of the SNR is also i
ntroduced to provide adaptability at different SNR conditions. The second a
lgorithm, cepstral mean normalization and variance-scaling technique (CMNVS
), is an extension of the cepstral mean normalization (CMN) technique to pr
ovide robust features to convolutive and additive noise distortions. The re
quirements of the techniques are also analyzed in the paper. Combining both
techniques the relative channel distortion effects were reduced to 90% on
the HTIMIT task and the relative additive noise effects were reduced to 77%
using the TIMIT database mixed with car noises at different SNR conditions
. (C) 2001 Elsevier Science B.V. All rights reserved.