SUPRASEGMENTAL DURATION CONTROL WITH MATRIX PARSING IN CONTINUOUS SPEECH RECOGNITION

Citation
H. Singer et S. Sagayama, SUPRASEGMENTAL DURATION CONTROL WITH MATRIX PARSING IN CONTINUOUS SPEECH RECOGNITION, Speech communication, 13(3-4), 1993, pp. 315-322
Citations number
6
Categorie Soggetti
Communication,"Language & Linguistics
Journal title
ISSN journal
01676393
Volume
13
Issue
3-4
Year of publication
1993
Pages
315 - 322
Database
ISI
SICI code
0167-6393(1993)13:3-4<315:SDCWMP>2.0.ZU;2-F
Abstract
This paper describes a unified framework for continuous speech recogni tion (CSR) under grammatical constraints, where trellis calculations a nd parsing are performed by the same simple fundamental operations, na mely multiplication and addition of likelihood matrices. The matrix pa rser is shown to be a generalization of the CYK parser. It also facili tates explicit supra-segmental duration control for all grammatical ca tegories. Preliminary results showed that improved duration control on the mora level raised the recognition accuracy for a phrase recogniti on task from 86.7% to 88.5%.