EFFICIENT BULK-LOADING OF GRIDFILES

Citation
St. Leutenegger et Dm. Nicol, EFFICIENT BULK-LOADING OF GRIDFILES, IEEE transactions on knowledge and data engineering, 9(3), 1997, pp. 410-420
Citations number
8
Categorie Soggetti
Information Science & Library Science","Computer Sciences, Special Topics","Engineering, Eletrical & Electronic","Computer Science Artificial Intelligence","Computer Science Information Systems
ISSN journal
10414347
Volume
9
Issue
3
Year of publication
1997
Pages
410 - 420
Database
ISI
SICI code
1041-4347(1997)9:3<410:EBOG>2.0.ZU;2-E
Abstract
This paper considers the problem of bulk-loading large data sets for t he gridfile multiattribute indexing technique. We propose a rectilinea r partitioning algorithm that heuristically seeks to minimize the size of the gridfile needed to ensure no bucket overflows. Empirical studi es on both synthetic data sets and on data sets drawn from computation al fluid dynamics applications demonstrate that our algorithm is very efficient, and is able to handle large data sets. In addition, we pres ent an algorithm for bulk-loading data sets too large to fit in main m emory. Utilizing a sort of the entire data set it creates a gridfile w ithout incurring any overflows.