Semantic integration of heterogeneous information sources

Citation
S. Bergamaschi et al., Semantic integration of heterogeneous information sources, DATA KN ENG, 36(3), 2001, pp. 215-249
Citations number
37
Categorie Soggetti
AI Robotics and Automatic Control
Journal title
DATA & KNOWLEDGE ENGINEERING
ISSN journal
0169023X → ACNP
Volume
36
Issue
3
Year of publication
2001
Pages
215 - 249
Database
ISI
SICI code
0169-023X(200103)36:3<215:SIOHIS>2.0.ZU;2-H
Abstract
Developing intelligent tools for the integration of information extracted f rom multiple heterogeneous sources is a challenging issue to effectively ex ploit the numerous sources available on-line in global information systems. In this paper, we propose intelligent, tool-supported techniques to inform ation extraction and integration from both structured and semistructured da ta sources. An object-oriented language, with an underlying Description Log ic, called ODLI3, derived from the standard ODMG is introduced for informat ion extraction. ODLI3 descriptions of the source schemas are exploited firs t to set a Common Thesaurus for the sources. Information integration is the n performed in a semiautomatic way by exploiting the knowledge in the Commo n Thesaurus and ODLI3 descriptions of source schemas with a combination of clustering techniques and Description Logics. This integration process give s rise to a virtual integrated view of the underlying sources for which map ping rules and integrity constraints are specified to handle heterogeneity. Integration techniques described in the paper are provided in the framewor k of the MOMIS system based on a conventional wrapper/mediator architecture . (C) 2001 Elsevier Science B.V. All rights reserved.