DETERMINING INFORMATION-RETRIEVAL AND FILTERING PERFORMANCE WITHOUT EXPERIMENTATION

Authors
Citation
Rm. Losee, DETERMINING INFORMATION-RETRIEVAL AND FILTERING PERFORMANCE WITHOUT EXPERIMENTATION, Information processing & management, 31(4), 1995, pp. 555-572
Citations number
44
Categorie Soggetti
Information Science & Library Science","Information Science & Library Science","Computer Science Information Systems
ISSN journal
03064573
Volume
31
Issue
4
Year of publication
1995
Pages
555 - 572
Database
ISI
SICI code
0306-4573(1995)31:4<555:DIAFPW>2.0.ZU;2-X
Abstract
The performance of an information retrieval or text and media filterin g system may be determined through analytic methods as well as by trad itional simulation or experimental methods. These analytic methods can provide precise statements about expected performance. They can thus determine which of two similarly performing systems is superior. For b oth a single query term and for a multiple query term retrieval model, a method for comparing the performance of different probabilistic ret rieval methods is developed. This method may be used in computing the average search length for a query, given only knowledge of database pa rameter values. Predictive models for inverse document frequency, bina ry independence, and relevance feedback based retrieval and filtering are described. Simulations illustrate how the single term model perfor ms and sample performance predictions are given for single term and mu ltiple term problems.