THE STRONG LAW OF LARGE NUMBERS FOR SEQUENTIAL DECISIONS UNDER UNCERTAINTY

Authors
Citation
Ph. Algoet, THE STRONG LAW OF LARGE NUMBERS FOR SEQUENTIAL DECISIONS UNDER UNCERTAINTY, IEEE transactions on information theory, 40(3), 1994, pp. 609-633
Citations number
109
Categorie Soggetti
Information Science & Library Science","Engineering, Eletrical & Electronic
ISSN journal
00189448
Volume
40
Issue
3
Year of publication
1994
Pages
609 - 633
Database
ISI
SICI code
0018-9448(1994)40:3<609:TSLOLN>2.0.ZU;2-L
Abstract
We combine optimization and ergodic theory to characterize the optimum long-run average performance that can be asymptotically attained by n onanticipating sequential decisions. Let {X(t)} be a stationary ergodi c process, and suppose an action b(t) must be selected in a space B wi th knowledge of the t-past (X0,...,X(t-1)) at the beginning of every p eriod t greater-than-or-equal-to 0. Action b(t) will incur a loss l(b( t), X(t)) at the end of period t when the random variable X(t) is reve aled. We prove under mild integrability conditions that the optimum st rategy is to select actions that minimize the conditional expected los s given the currently available information at each step. The minimum long-run average loss per decision can be approached arbitrarily close ly by strategies that are finite-order Markov, and under certain conti nuity conditions, it is equal to the minimum expected loss given the i nfinite past. If the loss l(b, x) is bounded and continuous and if the space B is compact, then the minimum can be asymptotically attained, even if the distribution of the process {X(t)} is unknown a priori and must be learned from experience.