An architecture for data warehousing supporting data independence and interoperability

Citation
L. Cabibbo et R. Torlone, An architecture for data warehousing supporting data independence and interoperability, INT J COOP, 10(3), 2001, pp. 377-397
Citations number
31
Categorie Soggetti
Information Tecnology & Communication Systems
Journal title
INTERNATIONAL JOURNAL OF COOPERATIVE INFORMATION SYSTEMS
ISSN journal
02188430 → ACNP
Volume
10
Issue
3
Year of publication
2001
Pages
377 - 397
Database
ISI
SICI code
0218-8430(200109)10:3<377:AAFDWS>2.0.ZU;2-E
Abstract
We report on the design of a novel architecture for data warehousing based on the introduction of an explicit "logical" layer to the traditional data warehousing framework. This layer serves to guarantee a complete independen ce of OLAP applications from the physical storage structure of the data war ehouse and thus allows users and applications to manipulate multidimensiona l data ignoring implementation details. For example, it makes possible the modification of the data warehouse organization (e.g. MOLAP or ROLAP implem entation, star scheme or snowflake scheme structure) without influencing th e high level description of multidimensional data and programs that use the data. Also, it supports the integration of multidimensional data stored in heterogeneous OLAP servers. We propose MD, a simple data model for multidi mensional databases, as the reference for the logical layer. MD provides an abstract formalism to describe the basic concepts that can be found in any OLAP system (fact, dimension, level of aggregation, and measure). We show that MD databases can be Implemented in both relational and multidimensiona l storage systems. We also show that MD can be profitably used in OLAP appl ications as front-end. We finally describe the design of a practical system that supports the above logical architecture; this system is used to show in practice how the architecture we propose can hide implementation details and provides a support for interoperability between different and possibly heterogeneous data warehouse applications.