A survey of smoothing techniques for ME models

Citation
Sf. Chen et R. Rosenfeld, A survey of smoothing techniques for ME models, IEEE SPEECH, 8(1), 2000, pp. 37-50
Citations number
43
Categorie Soggetti
Eletrical & Eletronics Engineeing
Journal title
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING
ISSN journal
10636676 → ACNP
Volume
8
Issue
1
Year of publication
2000
Pages
37 - 50
Database
ISI
SICI code
1063-6676(200001)8:1<37:ASOSTF>2.0.ZU;2-3
Abstract
In certain contexts, maximum entropy (ME) modeling can be viewed as maximum likelihood (ML) training for exponential models, and like other ML methods is prone to overfitting of training data. Several smoothing methods for ME models have been proposed to address this problem, but previous results do not make it clear how these smoothing methods compare with smoothing metho ds for other types of related models. In this work, we survey previous work in ME smoothing and compare the performance of several of these algorithms with conventional techniques for smoothing n-gram language models. Because of the mature body of research in n-gram model smoothing and the close con nection between ME and conventional n-gram models, this domain is well-suit ed to gauge the performance of ME smoothing methods. Over a large number of data sets, we find that fizzy ME smoothing performs as well as or better t han all other algorithms under consideration. We contrast this method with previous n-gram smoothing methods to explain its superior performance.