ITA
ENG

SPEAKER-CONSISTENT PARSING FOR SPEAKER-INDEPENDENT CONTINUOUS SPEECH RECOGNITION

Authors

YAMAGUCHI K SINGER H MATSUNAGA S SAGAYAMA S

Citation

K. Yamaguchi et al., SPEAKER-CONSISTENT PARSING FOR SPEAKER-INDEPENDENT CONTINUOUS SPEECH RECOGNITION, IEICE transactions on information and systems, E78D(6), 1995, pp. 719-724

Citations number

Categorie Soggetti

Computer Science Information Systems

Journal title

IEICE transactions on information and systems → ACNP

ISSN journal

09168532

Volume

E78D

Issue

Year of publication

1995

Pages

719 - 724

Database

ISI

SICI code

0916-8532(1995)E78D:6<719:SPFSCS>2.0.ZU;2-#

Abstract

This paper describes a novel speaker-independent speech recognition me thod, called ''speaker-consistent parsing'', which is based on an intr a-speaker correlation called the speaker-consistency principle. We foc us on the fact that a sentence or a string of words is uttered by an i ndividual speaker even in a speaker-independent task. Thus, the propos ed method searches through speaker variations in addition to the conte nts of utterances. As a result of the recognition process, an appropri ate standard speaker is selected for speaker adaptation. This new meth od is experimentally compared with a conventional speaker-independent speech recognition method. Since the speaker-consistency principle bes t demonstrates its effect with a large number of training and test spe akers, a small-scale experiment may not fully exploit this principle. Nevertheless, even the results of our small-scale experiment show that the new method significantly outperforms the conventional method. In addition, this framework's speaker selection mechanism can drastically reduce the likelihood map computation.