The paper combines a comprehensive account of a probabilistic model of retr
ieval with new systematic experiments on TREC Programme material. It presen
ts the model from its foundations through its logical development to cover
more aspects of retrieval data and a wider range of system functions. Each
step in the argument is matched by comparative retrieval tests, to provide
a single coherent account of a major line of research. The experiments demo
nstrate, for a large test collection, that the probabilistic model is effec
tive and robust, and that it responds appropriately, with major improvement
s in performance, to key features of retrieval situations.
Part 1 covers the foundations and the model development for document collec
tion and relevance data, along with the test apparatus. Part 2 covers the f
urther development and elaboration of the model, with extensive testing, an
d briefly considers other environment conditions and tasks, model training,
concluding with comparisons with other approaches and an overall assessmen
t.
Data and results tables for both parts ave given in Part 1. Key results are
summarised in Part 2. (C) 2000 Elsevier Science Ltd. All rights reserved.