A visualization case study of feature vector and stemmer effects on TREC topic-document subsets

Citation
Mt. Rorvig et al., A visualization case study of feature vector and stemmer effects on TREC topic-document subsets, P ASIS ANNU, 35, 1998, pp. 130-141
Citations number
45
Categorie Soggetti
Library & Information Science
Journal title
PROCEEDINGS OF THE ASIS ANNUAL MEETING
ISSN journal
00447870 → ACNP
Volume
35
Year of publication
1998
Pages
130 - 141
Database
ISI
SICI code
0044-7870(1998)35:<130:AVCSOF>2.0.ZU;2-1
Abstract
A method of visual analysis is demonstrated which takes advantage of the "p ooling" technique of topic-document set creation in the TREC collection. TR EC topic-document sets create a specific pattern when converted to similari ty measures, scaled, and plotted. Using the visual pattern created by full text as a normative view of the data, the effect of feature vectors and ste mming on recovering the normative view is shown visually. When stemmed, fea ture vectors of length 200 were shown to substantially recover the normativ e visual configuration created by full text. Some caution regarding the use of stemming is indicated by the dispersion of documents in the visual fiel d if feature vector approaches are to be applied to filtering tasks.