Bi-modal sentence structure for language modeling

Citation
Kw. Ma et al., Bi-modal sentence structure for language modeling, SPEECH COMM, 31(1), 2000, pp. 51-67
Citations number
21
Categorie Soggetti
Computer Science & Engineering
Journal title
SPEECH COMMUNICATION
ISSN journal
01676393 → ACNP
Volume
31
Issue
1
Year of publication
2000
Pages
51 - 67
Database
ISI
SICI code
0167-6393(200005)31:1<51:BSSFLM>2.0.ZU;2-9
Abstract
According to discourse theories in linguistics, conversational utterances p ossess an informational structure. That is, each sentence consists of two c omponents: the given and the new. The given refers to information that has previously been conveyed in the conversation such as that in That's interes ting. The new section of a sentence introduces additional information that is new to the conversation such as the word interesting in the previous exa mple. In this work, we take advantage of this inherent structure for the pu rpose of automatic conversational speech recognition by building sub-senten ce discourse language models (LMs) to represent the bi-modal nature of each conversational sentence. The internal sentence structure is captured with a statistical sentence model regardless of whether the input sentences are linguistically or acoustically segmented. The proposed model is verified on the Switchboard corpus. The resulting model contributes to a reduction in both LM perplexity and word recognition error rate. (C) 2000 Elsevier Scien ce B.V. All rights reserved.