CLUSTERING FACTOR ESTIMATION FOR TOTALLY CLUSTERED ATTRIBUTES

Citation
F. Grandi et Mr. Scalas, CLUSTERING FACTOR ESTIMATION FOR TOTALLY CLUSTERED ATTRIBUTES, Data & knowledge engineering, 14(3), 1995, pp. 251-264
Citations number
16
Categorie Soggetti
Computer Science Artificial Intelligence","Computer Science Information Systems
ISSN journal
0169023X
Volume
14
Issue
3
Year of publication
1995
Pages
251 - 264
Database
ISI
SICI code
0169-023X(1995)14:3<251:CFEFTC>2.0.ZU;2-K
Abstract
Cost models based on the clustering factor (CF) of the attributes have been proposed and shown to be attractive for block access estimation in databases, thanks to their accuracy and economy of use. While query optimizers can use the actual CFs, measured from the data, physical d esign methods and tools must rely on estimates before the data are sto red. In this paper we present a CF estimation procedure which can be a pplied to totally clustered attributes (e.g. ordered attributes). Simp le and accurate approximations of the derived formulas are also introd uced. Simulations show the accuracy of the proposed CF estimates and t he improvement in their behaviour compared to previously published est imates. Reliability for physical design of cost models based on the CF in the presence of a skewed data distribution is also discussed.