ITA
ENG

ZERO-SUM MARKOV GAMES AND WORST-CASE OPTIMAL-CONTROL OF QUEUING-SYSTEMS

Authors

ALTMAN E HORDIJK A

Citation

E. Altman et A. Hordijk, ZERO-SUM MARKOV GAMES AND WORST-CASE OPTIMAL-CONTROL OF QUEUING-SYSTEMS, Queuing systems, 21(3-4), 1995, pp. 415-447

Citations number

Categorie Soggetti

Operatione Research & Management Science","Computer Science Interdisciplinary Applications

Journal title

Queuing systems → ACNP

ISSN journal

02570130

Volume

Issue

3-4

Year of publication

1995

Pages

415 - 447

Database

ISI

SICI code

0257-0130(1995)21:3-4<415:ZMGAWO>2.0.ZU;2-U

Abstract

Zero-sum stochastic games model situations where two persons, called p layers, control some dynamic system, and both have opposite objectives . One player wishes typically to minimize a cost which has to be paid to the other player. Such a game may also be used to model problems wi th a single controller who has only partial information on the system: the dynamic of the system may depend on some parameter that is unknow n to the controller, and may vary in time in an unpredictable way. A w orst-case criterion may be considered, where the unknown parameter is assumed to be chosen by ''nature'' (called player 1), and the objectiv e of the controller (player 2) is then to design a policy that guarant ees the best performance under worst-case behaviour of nature. The pur pose of this paper is to present a survey of stochastic games in queue s, where both tools and applications are considered. The first part is devoted to the tools. We present some existing tools for solving fini te horizon and infinite horizon discounted Markov games with unbounded cost, and develop new ones that are typically applicable in queueing problems. We then present some new tools and theory of expected averag e cost stochastic games with unbounded cost. In the second part of the paper we present a survey on existing results on worst-case control o f queues, and illustrate the structural properties of best policies of the controller, worst-case policies of nature, and of the value funct ion. Using the theory developed in the first part of the paper, we ext end some of the above results, which were known to hold for finite hor izon costs or for the discounted cost, to the expected average cost.