SPEAKER-CONSISTENT PARSING FOR SPEAKER-INDEPENDENT CONTINUOUS SPEECH RECOGNITION

Citation
K. Yamaguchi et al., SPEAKER-CONSISTENT PARSING FOR SPEAKER-INDEPENDENT CONTINUOUS SPEECH RECOGNITION, IEICE transactions on information and systems, E78D(6), 1995, pp. 719-724
Citations number
NO
Categorie Soggetti
Computer Science Information Systems
ISSN journal
09168532
Volume
E78D
Issue
6
Year of publication
1995
Pages
719 - 724
Database
ISI
SICI code
0916-8532(1995)E78D:6<719:SPFSCS>2.0.ZU;2-#
Abstract
This paper describes a novel speaker-independent speech recognition me thod, called ''speaker-consistent parsing'', which is based on an intr a-speaker correlation called the speaker-consistency principle. We foc us on the fact that a sentence or a string of words is uttered by an i ndividual speaker even in a speaker-independent task. Thus, the propos ed method searches through speaker variations in addition to the conte nts of utterances. As a result of the recognition process, an appropri ate standard speaker is selected for speaker adaptation. This new meth od is experimentally compared with a conventional speaker-independent speech recognition method. Since the speaker-consistency principle bes t demonstrates its effect with a large number of training and test spe akers, a small-scale experiment may not fully exploit this principle. Nevertheless, even the results of our small-scale experiment show that the new method significantly outperforms the conventional method. In addition, this framework's speaker selection mechanism can drastically reduce the likelihood map computation.