CONTINUOUS SPEECH DICTATION - FROM THEORY TO PRACTICE

Citation
V. Steinbiss et al., CONTINUOUS SPEECH DICTATION - FROM THEORY TO PRACTICE, Speech communication, 17(1-2), 1995, pp. 19-38
Citations number
33
Categorie Soggetti
Communication,"Language & Linguistics
Journal title
ISSN journal
01676393
Volume
17
Issue
1-2
Year of publication
1995
Pages
19 - 38
Database
ISI
SICI code
0167-6393(1995)17:1-2<19:CSD-FT>2.0.ZU;2-6
Abstract
This paper gives an overview of the Philips research system for phonem e-based, large-vocabulary, continuous-speech recognition. The system h as been successfully applied to various tasks in the German and (Ameri can) English languages, ranging from small vocabulary tasks to very la rge vocabulary tasks. Here, we concentrate on continuous-speech recogn ition for dictation in real applications, the dictation of legal repor ts and radiology reports in German. We describe this task and report o n experimental results. We also describe a commercial PC-based dictati on system which includes a PC implementation of our scientific recogni tion prototype. In order to allow for a comparison with the performanc e of other systems, a section with an evaluation on the standard Wall Street Journal task (dictation of American English newspaper text) is supplied. The recognition architecture is based on an integrated stati stical approach. We describe the characteristic features of the system as opposed to other systems: 1. the Viterbi criterion is consistently applied both in training and testing; 2. continuous mixture densities are used without tying or smoothing; 3. time-synchronous beam search in connection with a phoneme look-ahead is applied to a tree-organized lexicon.