TERM DEPENDENCE - TRUNCATING THE BAHADUR LAZARSFELD EXPANSION

Authors
Citation
Rm. Losee, TERM DEPENDENCE - TRUNCATING THE BAHADUR LAZARSFELD EXPANSION, Information processing & management, 30(2), 1994, pp. 293-303
Citations number
19
Categorie Soggetti
Information Science & Library Science","Information Science & Library Science","Computer Science Information Systems
ISSN journal
03064573
Volume
30
Issue
2
Year of publication
1994
Pages
293 - 303
Database
ISI
SICI code
0306-4573(1994)30:2<293:TD-TTB>2.0.ZU;2-M
Abstract
The performance of probabilistic information retrieval systems is stud ied where differing statistical dependence assumptions are used when e stimating the probabilities inherent in the retrieval model. Experimen tal results using the Bahadur Lazarsfeld expansion suggest that the gr eatest degree of performance increase is achieved by incorporating ter m dependence information in estimating Pr (d\rel). It is suggested tha t incorporating dependence in Pr (d\rel) to degree 3 be used; incorpor ating more dependence information results in relatively little increas e in performance. Experiments examine the span of dependence in natura l language text, the window of terms in which dependencies are compute d, and their effect on information retrieval performance. Results prov ide additional support for the notion of a window of +/- 3 to +/- 5 te rms in width; terms in this window may be most useful when computing d ependence.