ITA
ENG

FROM HMMS TO SEGMENT MODELS - A UNIFIED VIEW OF STOCHASTIC MODELING FOR SPEECH RECOGNITION

Authors

OSTENDORF M DIGALAKIS VV KIMBALL OA

Citation

M. Ostendorf et al., FROM HMMS TO SEGMENT MODELS - A UNIFIED VIEW OF STOCHASTIC MODELING FOR SPEECH RECOGNITION, IEEE transactions on speech and audio processing, 4(5), 1996, pp. 360-378

Citations number

Categorie Soggetti

Engineering, Eletrical & Electronic",Acoustics

Journal title

IEEE transactions on speech and audio processing → ACNP

ISSN journal

10636676

Volume

Issue

Year of publication

1996

Pages

360 - 378

Database

ISI

SICI code

1063-6676(1996)4:5<360:FHTSM->2.0.ZU;2-H

Abstract

In recent years, many alternative models have been proposed to address some of the shortcomings of the hidden Markov model (HMM), which is c urrently the most popular approach to speech recognition, In particula r, a variety of models that could be broadly classified as segment mod els have been described for representing a variable-length sequence of observation vectors in speech recognition applications, Since there a re many aspects in common between these approaches, including the gene ral recognition and training problems, it is useful to consider them i n a unified framework. Thus, the goal of this paper will he to describ e a general stochastic model that encompasses most of the models propo sed in the literature, pointing out similarities of the models in term s of correlation and parameter tying assumptions, and drawing analogie s between segment models and HMM's. In addition, we summarize experime ntal results assessing different modeling assumptions and point out re maining open questions.