BAYESIAN CHANNEL EQUALIZATION AND ROBUST FEATURES FOR SPEECH RECOGNITION

Citation
Bp. Milner et Sv. Vaseghi, BAYESIAN CHANNEL EQUALIZATION AND ROBUST FEATURES FOR SPEECH RECOGNITION, IEE proceedings. Vision, image and signal processing, 143(4), 1996, pp. 223-231
Citations number
17
Categorie Soggetti
Engineering, Eletrical & Electronic
ISSN journal
1350245X
Volume
143
Issue
4
Year of publication
1996
Pages
223 - 231
Database
ISI
SICI code
1350-245X(1996)143:4<223:BCEARF>2.0.ZU;2-M
Abstract
The use of a speech recognition system with telephone channel environm ents, or different microphones, requires channel equalisation. In spee ch recognition, the speech model provides a bank of statistical inform ation that can be used in the channel identification and equalisation process. The authors consider HMM-based channel equalisation, and pres ent results demonstrating that substantial improvement can be obtained through the equalisation process. An alternative method, for speech r ecognition, is to use a feature set which is more robust to channel di stortion. Channel distortions result in an amplutude tilt of the speec h cepstrum, and therefore differential cepstral features provide a mea sure of immunity to channel distortions. In particular the cepstral-ti me feature matrix, in addition to providing a framework for representi ng speech dynamics, call be made robust to channel distortions. The au thors present results demonstrating that a major advantage of cepstral -time matrices is their channel insensitive character.