Vocabulary mining for information retrieval: rough sets and fuzzy sets

Citation
P. Srinivasan et al., Vocabulary mining for information retrieval: rough sets and fuzzy sets, INF PR MAN, 37(1), 2001, pp. 15-38
Citations number
29
Categorie Soggetti
Library & Information Science","Information Tecnology & Communication Systems
Journal title
INFORMATION PROCESSING & MANAGEMENT
ISSN journal
03064573 → ACNP
Volume
37
Issue
1
Year of publication
2001
Pages
15 - 38
Database
ISI
SICI code
0306-4573(200101)37:1<15:VMFIRR>2.0.ZU;2-Z
Abstract
Vocabulary mining in information retrieval refers to the utilization of the domain vocabulary towards improving the user's query. Most often queries p osed to information retrieval systems are not optimal for retrieval purpose s. Vocabulary mining allows one to generalize, specialize or perform other kinds of vocabulary-based transformations on the query in order to improve retrieval performance. This paper investigates a new framework for vocabula ry mining that derives from the combination of rough sets and fuzzy sets. T he framework allows one to use rough set-based approximations even when the documents and queries are described using weighted, i.e,, fuzzy representa tions. The paper also explores the application of generalized rough sets an d the variable precision models. The problem of coordination between multip le vocabulary views is also examined. Finally, a preliminary analysis of is sues that arise when applying the proposed vocabulary mining framework to t he Unified Medical Language System (a state-of-the-art vocabulary system) i s presented. The proposed framework supports the systematic study and appli cation of different vocabulary views in information retrieval. (C) 2000 Els evier Science Ltd. All rights reserved.