A PROBABILISTIC RELATIONAL ALGEBRA FOR THE INTEGRATION OF INFORMATION-RETRIEVAL AND DATABASE-SYSTEMS

Authors
Citation
N. Fuhr et T. Rolleke, A PROBABILISTIC RELATIONAL ALGEBRA FOR THE INTEGRATION OF INFORMATION-RETRIEVAL AND DATABASE-SYSTEMS, ACM transactions on information systems, 15(1), 1997, pp. 32-66
Citations number
48
Categorie Soggetti
Information Science & Library Science","Computer Science Information Systems
ISSN journal
10468188
Volume
15
Issue
1
Year of publication
1997
Pages
32 - 66
Database
ISI
SICI code
1046-8188(1997)15:1<32:APRAFT>2.0.ZU;2-C
Abstract
We present a probabilistic relational algebra (PRA) which is a general ization of standard relational algebra. In PRA, tuples are assigned pr obabilistic weights giving the probability that a tuple belongs to a r elation. Based on intensional semantics, the tuple weights of the resu lt of a PRA expression always conform to the underlying probabilistic model. We also show for which expressions extensional semantics yields the same results. Furthermore, we discuss complexity issues and indic ate possibilities for optimization. With regard to databases, the appr oach allows for representing imprecise attribute values, whereas for i nformation retrieval, probabilistic document indexing and probabilisti c search term weighting can be modeled. We introduce the concept of va gue predicates which yield probabilistic weights instead of Boolean va lues, thus allowing for queries with vague selection conditions. With these features, PRA implements uncertainty and vagueness in combinatio n with the relational model.