Generalized mel frequency cepstral coefficients for large-vocabulary speaker-independent continuous-speech recognition

Citation
R. Vergin et al., Generalized mel frequency cepstral coefficients for large-vocabulary speaker-independent continuous-speech recognition, IEEE SPEECH, 7(5), 1999, pp. 525-532
Citations number
15
Categorie Soggetti
Eletrical & Eletronics Engineeing
Journal title
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING
ISSN journal
10636676 → ACNP
Volume
7
Issue
5
Year of publication
1999
Pages
525 - 532
Database
ISI
SICI code
1063-6676(199909)7:5<525:GMFCCF>2.0.ZU;2-F
Abstract
The focus of a continuous speech recognition process is to match an input s ignal with a set of words or sentences according to some optimality criteri a. The first step of this process is parameterization, whose major task is data reduction by converting the input signal into parameters while preserv ing virtually all of the speech signal information dealing with the text me ssage. This contribution presents a detailed analysis of a widely used set of parameters, the mel frequency cepstral coefficients (MFCC's), and sugges ts a new parameterization approach taking into account the whole energy zon e in the spectrum. Results obtained with the proposed new coefficients give a confidence interval about their use in a large-vocabulary speaker-indepe ndent continuous-speech recognition system.