S. Igawa et al., SPEECH RECOGNITION BASED ON FUSION OF VISUAL AND AUDITORY INFORMATIONUSING FULL-FRAME COLOR IMAGE, IEICE transactions on fundamentals of electronics, communications and computer science, E79A(11), 1996, pp. 1836-1840
Citations number
3
Categorie Soggetti
Engineering, Eletrical & Electronic","Computer Science Hardware & Architecture","Computer Science Information Systems
We propose a method to fuse auditory information and visual informatio
n for accurate speech recognition. This method fuses two kinds of info
rmation by using linear combination after calculating two kinds of pro
babilities by HMM for each word. In addition, we use full-frame color
image as visual information in order to improve the accuracy of the pr
oposed speech recognition system. We have performed experiments compar
ing the proposed method with the method using either auditory informat
ion of visual information, and confirmed the validity of the proposed
method.