W. Lam et Kf. Low, Using discretization and Bayesian inference network learning for automaticfiltering profile generation, IEEE SYST C, 30(3), 2000, pp. 340-351
Citations number
30
Categorie Soggetti
AI Robotics and Automatic Control
Journal title
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS
We develop a new approach for text document filtering based on automatic co
nstruction of filtering profiles using Bayesian inference network learning.
Bayesian inference networks, based on probability theory, offer a suitable
framework to harness the uncertainty found in the nature of the filtering
problem. In order to learn the networks effectively, we explore three diffe
rent techniques for discretization, Good features of high predictive power
are automatically obtained from the training document content. Our approach
does not need to know in advance the subject or content of documents as we
ll as the information needs expressed as topics. A series of experiments on
a set of topics were conducted on two large-scale real-world document corp
ora, The empirical results demonstrate that our Bayesian inference network
learning with advanced discretization achieves better performance over the
simple Naive Bayesian approach.