Making aggregate views self-maintainable

Citation
M. Mohania et Y. Kambayashi, Making aggregate views self-maintainable, DATA KN ENG, 32(1), 2000, pp. 87-109
Citations number
33
Categorie Soggetti
AI Robotics and Automatic Control
Journal title
DATA & KNOWLEDGE ENGINEERING
ISSN journal
0169023X → ACNP
Volume
32
Issue
1
Year of publication
2000
Pages
87 - 109
Database
ISI
SICI code
0169-023X(200001)32:1<87:MAVS>2.0.ZU;2-6
Abstract
Data in the warehouse can be seen as materialized views generated from the underlying multiple data sources. Materialized views are used to speedup qu ery processing on large amounts of data. These views need to be maintained in response to updates in the source data. This is often done using increme ntal techniques that access data from underlying sources. In the data wareh ousing scenario, accessing base relations can be difficult, sometimes data sources may be unavailable, since these relations are distributed across di fferent sources. For these reasons, the issue of self-maintainability of th e view is an important issue in data warehousing. In this paper we show tha t the warehouse views can be made self-maintainable if some additional rela tions, called auxiliary relations, derived from the intermediate results of the view computation can be materialized in the warehouse. We give an algo rithm for determining what auxiliary relations need to be materialized in o rder to make a materialized view self-maintainable. We then propose an effi cient self-maintainable incremental algorithm that computes the updates to both the materialized view and the additional relations. The primary object ive is to minimize the overall maintenance cost of a given view and the aux iliary relations. One important feature of our algorithm is that it derives the 'exact change' to every materialized additional relation, including th e materialized view, without accessing the view itself. This feature is imp ortant to ensure the correctness of the update to views defined by aggregat e functions, and also important in active database applications where trigg ers are fired by updates to the view. Finally, we compare the maintenance c ost of our incremental algorithm to that of counting algorithm and recomput ing the view from scratch (naive algorithm). (C) 2000 Elsevier Science B.V. All rights reserved.