Information filtering: Overview of issues, research and systems

Citation
U. Hanani et al., Information filtering: Overview of issues, research and systems, USER MOD US, 11(3), 2001, pp. 203-259
Citations number
140
Categorie Soggetti
Computer Science & Engineering
Journal title
USER MODELING AND USER-ADAPTED INTERACTION
ISSN journal
09241868 → ACNP
Volume
11
Issue
3
Year of publication
2001
Pages
203 - 259
Database
ISI
SICI code
0924-1868(200108)11:3<203:IFOOIR>2.0.ZU;2-L
Abstract
An abundant amount of information is created and delivered over electronic media. Users risk becoming overwhelmed by the flow of information, and they lack adequate tools to help them manage the situation. Information filteri ng (IF) is one of the methods that is rapidly evolving to manage large info rmation flows. The aim of IF is to expose users to only information that is relevant to them. Many IF systems have been developed in recent years for various application domains. Some examples of filtering applications are: f ilters for search results on the internet that are employed in the Internet software, personal e-mail filters based on personal profiles, listservers or newsgroups filters for groups or individuals, browser filters that block non-valuable information, filters designed to give children access them on ly to suitable pages, filters for e-commerce applications that address prod ucts and promotions to potential customers only, and many more. The differe nt systems use various methods, concepts, and techniques from diverse resea rch areas like: Information Retrieval, Artificial Intelligence, or Behavior al Science. Various systems cover different scope, have divergent functiona lity, and various platforms. There are many systems of widely varying philo sophies, but all share the goal of automatically directing the most valuabl e information to users in accordance with their User Model, and of helping them use their limited reading time most optimally. This paper clarifies the difference between IF systems and related systems, such as information retrieval (IR) systems, or Extraction systems. The pap er defines a framework to classify IF systems according to several paramete rs, and illustrates the approach with commercial and academic systems. The paper describes the underlying concepts of IF systems and the techniques th at are used to implement them. It discusses methods and measurements that a re used for evaluation of IF systems and limitations of the current systems . In the conclusion we present research issues in the Information Filtering research arena, such as user modeling, evaluation standardization and inte gration with digital libraries and Web repositories.