ALLOCATING DATA AND OPERATIONS TO NODES IN DISTRIBUTED DATABASE DESIGN

Authors
Citation
St. March et S. Rho, ALLOCATING DATA AND OPERATIONS TO NODES IN DISTRIBUTED DATABASE DESIGN, IEEE transactions on knowledge and data engineering, 7(2), 1995, pp. 305-317
Citations number
30
Categorie Soggetti
Information Science & Library Science","Computer Sciences, Special Topics","Engineering, Eletrical & Electronic","Computer Science Artificial Intelligence
ISSN journal
10414347
Volume
7
Issue
2
Year of publication
1995
Pages
305 - 317
Database
ISI
SICI code
1041-4347(1995)7:2<305:ADAOTN>2.0.ZU;2-J
Abstract
The allocation of data and operations to nodes in a computer communica tions network is a critical issue in distributed database design, An e fficient distributed database design must trade off performance and co st among retrieval and update activities at the various nodes, It must consider the concurrency control mechanism used as well as capacity c onstraints at nodes and on links in the network, It must determine whe re data will be allocated, the degree of data replication, which copy of the data will be used for each retrieval activity, and where operat ions such as select, project, join, and union will be performed, We de velop a comprehensive mathematical modeling approach for this problem, The approach first generates units of data (file fragments) to be all ocated from a logical data model representation and a characterization of retrieval and update activities, Retrieval and up date activities are then decomposed into relational operations on these fragments. Bot h fragments and operations on them are then allocated to nodes using a mathematical modeling approach, The mathematical model considers netw ork communication, local processing, and data storage costs, A genetic algorithm is developed to solve this mathematical formulation.