On performance potentials and conditional Monte Carlo for gradient estimation for Markov chains

Citation
Xr. Cao et al., On performance potentials and conditional Monte Carlo for gradient estimation for Markov chains, ANN OPER R, 87, 1999, pp. 263-272
Citations number
15
Categorie Soggetti
Engineering Mathematics
Journal title
ANNALS OF OPERATIONS RESEARCH
ISSN journal
02545330 → ACNP
Volume
87
Year of publication
1999
Pages
263 - 272
Database
ISI
SICI code
0254-5330(1999)87:<263:OPPACM>2.0.ZU;2-S
Abstract
We consider the problem of sample path-based gradient estimation for long-r un (steady-state) performance measures defined on discrete-time Markov chai ns. We show how two estimators - one derived using the likelihood ratio met hod with conditional Monte Carlo and splitting, and the other derived using performance potentials and perturbation analysis - are related. In particu lar, one can be expressed as the conditional expectation of a suitably weig hted average of the other. This demonstrates yet another connection between the two gradient estimation techniques of perturbation analysis and the li kelihood ratio method.