An iterative algorithm for extending learners to a semi-supervised setting

Citation
Culp, Mark et Michailidis, George, An iterative algorithm for extending learners to a semi-supervised setting, Journal of computational and graphical statistics , 17(3), 2008, pp. 545-571
ISSN journal
10618600
Volume
17
Issue
3
Year of publication
2008
Pages
545 - 571
Database
ACNP
SICI code
Abstract
In this article, we present an iterative self-training algorithm whose objective is to extend learners from a supervised setting into a semi-supervised setting. The algorithm is based on using the predicted values for observations where the response is missing (unlabeled data) and then incorporating the predictions appropriately at subsequent stages. Convergence properties of the algorithm are investigated for particular learners, such as linear/logistic regression and linear smoothers with particular emphasis on kernel smoothers. Further, implementation issues of the algorithm with other learners such as generalized additive models, tree partitioning methods, partial least squares, etc. are also addressed. The connection between the proposed algorithm and graph-based semi-supervised learning methods is also discussed. The algorithm is illustrated on a number of real datasets using a varying degree of labeled responses.