A CLASSIFICATION APPROACH TO BOOLEAN QUERY REFORMULATION

Citation
Jc. French et al., A CLASSIFICATION APPROACH TO BOOLEAN QUERY REFORMULATION, Journal of the American Society for Information Science, 48(8), 1997, pp. 694-706
Citations number
22
Categorie Soggetti
Information Science & Library Science","Information Science & Library Science","Computer Science Information Systems
ISSN journal
00028231
Volume
48
Issue
8
Year of publication
1997
Pages
694 - 706
Database
ISI
SICI code
0002-8231(1997)48:8<694:ACATBQ>2.0.ZU;2-1
Abstract
One of the difficulties in using current Boolean-based information ret rieval systems is that it is hard for a user, especially a novice, to formulate an effective Boolean query. Query reformulation can be even more difficult and complex than formulation since users often have dif ficulty incorporating the new information gained from the previous sea rch into the next query. In this article, query reformulation is viewe d as a classification problem, that is, classifying documents as eithe r relevant or nonrelevant. A new reformulation algorithm is proposed w hich builds a tree-structured classifier, called a query tree, at each reformulation from a set of feedback documents retrieved from the pre vious search. The query tree can easily be transformed into a Boolean query. The query tree is compared to two query reformulation algorithm s on benchmark test sets (CACM, CISI, and Medlars). In most experiment s, the query tree showed significant improvements in precision over th e two algorithms compared in this study, We attribute this improved pe rformance to the ability of the query tree algorithm to select good se arch terms and to represent the relationships among search terms into a tree structure.