COMPARING BOOLEAN AND PROBABILISTIC INFORMATION-RETRIEVAL SYSTEMS ACROSS QUERIES AND DISCIPLINES

Authors
Citation
Rm. Losee, COMPARING BOOLEAN AND PROBABILISTIC INFORMATION-RETRIEVAL SYSTEMS ACROSS QUERIES AND DISCIPLINES, Journal of the American Society for Information Science, 48(2), 1997, pp. 143-156
Citations number
49
Categorie Soggetti
Information Science & Library Science","Information Science & Library Science","Computer Science Information Systems
ISSN journal
00028231
Volume
48
Issue
2
Year of publication
1997
Pages
143 - 156
Database
ISI
SICI code
0002-8231(1997)48:2<143:CBAPIS>2.0.ZU;2-S
Abstract
Whether using Boolean queries or ranking documents using document and term weights will result in better retrieval performance has been the subject of considerable discussion among document retrieval system use rs and researchers. We suggest a method that allows one to analyticall y compare the two approaches to retrieval and examine their relative m erits. The performance of information retrieval systems may be determi ned either by using experimental simulation, or through the applicatio n of analytic techniques that directly estimate the retrieval performa nce, given values for query and database characteristics. Using these performance predicting techniques, sample performance figures are prov ided for queries using the Boolean and and or, as well as for probabil istic systems assuming statistical term independence or term dependenc e. The variation of performance across sublanguages (used in different academic disciplines) and queries is examined. The performance of mod els failing to meet statistical and other assumptions is examined.