AUTOMATIC-DETERMINATION OF THE NUMBER OF MIXTURE COMPONENTS FOR CONTINUOUS HMMS BASED ON A UNIFORM VARIANCE CRITERION

Citation
T. Kosaka et S. Sagayama, AUTOMATIC-DETERMINATION OF THE NUMBER OF MIXTURE COMPONENTS FOR CONTINUOUS HMMS BASED ON A UNIFORM VARIANCE CRITERION, IEICE transactions on information and systems, E78D(6), 1995, pp. 642-647
Citations number
NO
Categorie Soggetti
Computer Science Information Systems
ISSN journal
09168532
Volume
E78D
Issue
6
Year of publication
1995
Pages
642 - 647
Database
ISI
SICI code
0916-8532(1995)E78D:6<642:AOTNOM>2.0.ZU;2-Q
Abstract
We discuss how to determine automatically the number of mixture compon ents in continuous mixture density HMMs (CHMMs). A notable trend has b een the use of CHMMs in recent years. One of the major problems with a CHMM is how to determine its structure, that is, how many mixture com ponents and states it has and its optimal topology. The number of mixt ure components has been determined heuristically so far. To solve this problem, we first investigate the influence of the number of mixture components on model parameters and the output log likelihood value. As a result, in contrast to the ''mixture number uniformity'' which is a pplied in conventional approaches to determine the number of mixture c omponents, we propose the principle of ''distribution size uniformity' '. An algorithm is introduced for automatically determining the number of mixture components. The performance of this algorithm is shown thr ough recognition experiments involving all Japanese phonemes. Two type s of experiments are carried out. One assumes that the number of mixtu re components for each state is the same within a phonetic model but m ay vary between states belonging to different phonemes. The other assu mes that each state has a variable number of mixture components. These two experiments give better results than the conventional method.