NOISE ROBUST SPEECH RECOGNITION USING SUBBAND-CROSS-CORRELATION ANALYSIS

Citation
S. Kajita et al., NOISE ROBUST SPEECH RECOGNITION USING SUBBAND-CROSS-CORRELATION ANALYSIS, IEICE transactions on information and systems, E81D(10), 1998, pp. 1079-1086
Citations number
12
Categorie Soggetti
Computer Science Information Systems
ISSN journal
09168532
Volume
E81D
Issue
10
Year of publication
1998
Pages
1079 - 1086
Database
ISI
SICI code
0916-8532(1998)E81D:10<1079:NRSRUS>2.0.ZU;2-7
Abstract
This paper describes subband-crosscorrelation analysis (SBXCOR) using two input channel signals. SBXCOR is an extended signal processing tec hnique of subband-autocorrelation analysis (SBCOR) that extracts perio dicities associated with the inverse of center frequencies present in speech signals. In addition, to extract more periodicity information a ssociated with the inverse of center frequencies, the multi-delay weig hting (MDW) processing is applied to SBXCOR. In experiments, the noise robustness of SBXCOR is evaluated using a DTW word recognizer under ( 1) a simulated acoustic condition with white noise and (2) a real acou stic condition in a sound proof room with human speech-like noise. As the results, under the simulated acoustic condition, it is shown that SBXCOR is more robust than the conventional one-channel SBCOR, but les s robust than SBCOR extracted From the two-channel-summed signal. Furt hermore, by applying MDW processing, the performance of SBXCOR improve d about 2% at SNR 0 dB. The resultant performance of SBXCOR with MDW p rocessing was much better than those of smoothed group delay spectrum (SGDS) and mel-filterbank cepstral coefficient (MFCC) below SNR 10 dB. The results under the real acoustic condition were almost the same as the simulated acoustic condition.