ITA
ENG

SIGNAL BIAS REMOVAL BY MAXIMUM-LIKELIHOOD-ESTIMATION FOR ROBUST TELEPHONE SPEECH RECOGNITION

Authors

RAHIM MG JUANG BH

Citation

Mg. Rahim et Bh. Juang, SIGNAL BIAS REMOVAL BY MAXIMUM-LIKELIHOOD-ESTIMATION FOR ROBUST TELEPHONE SPEECH RECOGNITION, IEEE transactions on speech and audio processing, 4(1), 1996, pp. 19-30

Citations number

Categorie Soggetti

Engineering, Eletrical & Electronic",Acoustics

Journal title

IEEE transactions on speech and audio processing → ACNP

ISSN journal

10636676

Volume

Issue

Year of publication

1996

Pages

19 - 30

Database

ISI

SICI code

1063-6676(1996)4:1<19:SBRBMF>2.0.ZU;2-K

Abstract

An acoustical mismatch between the training and the testing conditions of hidden Markov model (HMM)-based speech recognition systems often c auses a severe degradation in the recognition performance. In telephon e speech recognition, for example, undesirable signal components due t o ambient noise and channel distortion, as well as due to different va riations of telephone handsets render the recognizer unusable for real -world applications, This paper presents a signal bias removal (SBR) m ethod based on maximum likelihood estimation for the minimization of t hese undesirable effects, The proposed method is readily applicable in various architectures, i.e., discrete (vector-quantization based), se micontinuous and continuous density HMM, In this paper, the SBR method , integrated into a discrete density HMM, is applied to telephone spee ch recognition where the contamination due to extraneous signal compon ents is assumed to be unknown, To enable real-time implementation, a s equential method for the estimation of the bias is presented, Experime ntal results for speaker-independent connected digit recognition show a reduction in the per digit error rate by up to 41% and 14% during mi smatched and matched training and testing conditions, respectively.