SNOWBALL - SCALABLE STORAGE ON NETWORKS OF WORKSTATIONS WITH BALANCEDLOAD

Citation
R. Vingralek et al., SNOWBALL - SCALABLE STORAGE ON NETWORKS OF WORKSTATIONS WITH BALANCEDLOAD, DISTRIBUTED AND PARALLEL DATABASES, 6(2), 1998, pp. 117-156
Citations number
53
Categorie Soggetti
Computer Science Theory & Methods","Computer Science Information Systems","Computer Science Theory & Methods","Computer Science Information Systems
ISSN journal
09268782
Volume
6
Issue
2
Year of publication
1998
Pages
117 - 156
Database
ISI
SICI code
0926-8782(1998)6:2<117:S-SSON>2.0.ZU;2-3
Abstract
Networks of workstations are an emerging architectural paradigm for hi gh-performance parallel and distributed systems. Exploiting networks o f workstations for massive data management poses exciting challenges. We consider here the problem of managing record-structured data in suc h an environment. For example, managing collections of HTML documents on a cluster of WWW servers is an important application for which our approach provides support. The records are accessed by a dynamically g rowing set of clients based on a search key (e.g.,a URL). To scale up the throughput of client accesses with approximately constant response time, the records and thus also their access load are dynamically red istributed across a growing set of workstations. The paper addresses t wo problems of realistic workloads: skewed access frequencies to the r ecords and evolving access patterns where previously cold records may become hot and vice versa. Our solution incorporates load tracking at different levels of granularity and automatically chooses the appropri ate granularity for dynamic data migrations. Experimental results base d on a detailed simulation model show that our method is indeed succes sful in providing scalable cost/performance and explicitly controlling its level.