LINEAR TRAJECTORY MODELS INCORPORATING PREPROCESSING PARAMETERS FOR SPEECH RECOGNITION

Citation
R. Chengalvarayan, LINEAR TRAJECTORY MODELS INCORPORATING PREPROCESSING PARAMETERS FOR SPEECH RECOGNITION, IEEE signal processing letters, 5(3), 1998, pp. 66-68
Citations number
9
Categorie Soggetti
Engineering, Eletrical & Electronic
ISSN journal
10709908
Volume
5
Issue
3
Year of publication
1998
Pages
66 - 68
Database
ISI
SICI code
1070-9908(1998)5:3<66:LTMIPP>2.0.ZU;2-N
Abstract
In this letter, we investigate the interactions of front-end feature e xtraction and back-end classification techniques in nonstationary stat e hidden Markov model (NSHMM) based speech recognition. The proposed m odel aims at finding an optimal linear transformation on the mel-warpe d discrete Fourier tranform (DFT) features according to the minimum cl assification error (MCE) criterion. This linear transformation, along with the NSHMM parameters, are automatically trained using the gradien t descent method. An error rate reduction of 8% is obtained on a stand ard 39-class TIMIT phone classification task in comparison with the MC E-trained NSHMM using conventional preprocessing techniques.