Incorporating linguistic structure into statistical language models

Authors
Citation
R. Rosenfeld, Incorporating linguistic structure into statistical language models, PHI T ROY A, 358(1769), 2000, pp. 1311-1324
Citations number
33
Categorie Soggetti
Multidisciplinary
Journal title
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY OF LONDON SERIES A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES
ISSN journal
1364503X → ACNP
Volume
358
Issue
1769
Year of publication
2000
Pages
1311 - 1324
Database
ISI
SICI code
1364-503X(20000415)358:1769<1311:ILSISL>2.0.ZU;2-B
Abstract
Statistical language models estimate the distribution of natural language f or the purpose of improving various language technology applications. Ironi cally, the most successful models of this type take little advantage of the nature of language. I review the extent to which various aspects of natura l language are captured in current models. I then describe a general framew ork, recently developed at our laboratory, for incorporating arbitrary ling uistic structure into a statistical framework, and present a methodology fo r eliciting linguistic features currently missing from the model. Finally, I ponder our failure heretofore to integrate linguistic theories into a sta tistical framework, and suggest possible reasons for it.