Using a language independent domain model for multilingual information extraction

Citation
S. Azzam et al., Using a language independent domain model for multilingual information extraction, APPL ARTIF, 13(7), 1999, pp. 705-724
Citations number
17
Categorie Soggetti
AI Robotics and Automatic Control
Journal title
APPLIED ARTIFICIAL INTELLIGENCE
ISSN journal
08839514 → ACNP
Volume
13
Issue
7
Year of publication
1999
Pages
705 - 724
Database
ISI
SICI code
0883-9514(199910/11)13:7<705:UALIDM>2.0.ZU;2-Z
Abstract
The volume of electronic text in different languages, particularly on the W orld Wide Web, is growing significantly, and the problem of users who are r estricted in the number of languages they read obtaining information from t his text is becoming more widespread. This article investigates some of the issues involved in achieving multilingual information extraction (IE), des cribes the approach adopted in the M-LaSIE-II IE system, which addresses th ese problems, and presents the results of evaluating the approach against a small pal allel corpus ofEnglish/French newswire texts. The approach is ba sed on the assumption that it is possible to construct a language independe nt representation of concepts relevant to the domain, at least for the smal l well-defined domains typical of IE tasks, allowing multilingual IE to be successfully carried out without requiring full machine translation.