FROM HMMS TO SEGMENT MODELS - A UNIFIED VIEW OF STOCHASTIC MODELING FOR SPEECH RECOGNITION

Citation
M. Ostendorf et al., FROM HMMS TO SEGMENT MODELS - A UNIFIED VIEW OF STOCHASTIC MODELING FOR SPEECH RECOGNITION, IEEE transactions on speech and audio processing, 4(5), 1996, pp. 360-378
Citations number
97
Categorie Soggetti
Engineering, Eletrical & Electronic",Acoustics
ISSN journal
10636676
Volume
4
Issue
5
Year of publication
1996
Pages
360 - 378
Database
ISI
SICI code
1063-6676(1996)4:5<360:FHTSM->2.0.ZU;2-H
Abstract
In recent years, many alternative models have been proposed to address some of the shortcomings of the hidden Markov model (HMM), which is c urrently the most popular approach to speech recognition, In particula r, a variety of models that could be broadly classified as segment mod els have been described for representing a variable-length sequence of observation vectors in speech recognition applications, Since there a re many aspects in common between these approaches, including the gene ral recognition and training problems, it is useful to consider them i n a unified framework. Thus, the goal of this paper will he to describ e a general stochastic model that encompasses most of the models propo sed in the literature, pointing out similarities of the models in term s of correlation and parameter tying assumptions, and drawing analogie s between segment models and HMM's. In addition, we summarize experime ntal results assessing different modeling assumptions and point out re maining open questions.