ITA
ENG

Using a language independent domain model for multilingual information extraction

Authors

Azzam, S Humphreys, K Gaizauskas, R Wilks, Y

Citation

S. Azzam et al., Using a language independent domain model for multilingual information extraction, APPL ARTIF, 13(7), 1999, pp. 705-724

Citations number

Categorie Soggetti

AI Robotics and Automatic Control

Journal title

APPLIED ARTIFICIAL INTELLIGENCE

ISSN journal

08839514 → ACNP

Volume

Issue

Year of publication

1999

Pages

705 - 724

Database

ISI

SICI code

0883-9514(199910/11)13:7<705:UALIDM>2.0.ZU;2-Z

Abstract

The volume of electronic text in different languages, particularly on the W orld Wide Web, is growing significantly, and the problem of users who are r estricted in the number of languages they read obtaining information from t his text is becoming more widespread. This article investigates some of the issues involved in achieving multilingual information extraction (IE), des cribes the approach adopted in the M-LaSIE-II IE system, which addresses th ese problems, and presents the results of evaluating the approach against a small pal allel corpus ofEnglish/French newswire texts. The approach is ba sed on the assumption that it is possible to construct a language independe nt representation of concepts relevant to the domain, at least for the smal l well-defined domains typical of IE tasks, allowing multilingual IE to be successfully carried out without requiring full machine translation.