To survey stochastic games (the notion of value, complexity classes, strategy implementation, etc) and to cover some recent advances with algorithmic flavor.
Problems in finite horizon, discounted, with stopping time, and with mean-payoff.
Bellman’s dynamic programming equation. Positional strategies versus history-dependent strategies.
The zero-sum two-player case: Shapley’s extension of Bellman equation.
Miscellaneous examples, some being unexpected: risk-sensitive problems, log-glasses transform positive linear dynamics (population dynamics) to games, matrix scaling problems, nonnegative tensors.
The mean-payoff problem
The operator approach. The ergodic equation (additive eigenproblem). When it is solvable, the value of the mean-payoff game does exist, and coincides with the limit value of finite horizon games and discounted games.
Ergodicity notions for Markov decision processes. Bather’s theorem on communicating Markov decision processes.
Extensions of Bather’s theorem to the two-player case. Ergodicity notions for stochastic games. The role of dominions.
Non-ergodic turn-based games. Kohlberg’s theorem on the existence of invariant half-lines.
Blackwell optimal policies (optimal for all discount rates close enough to zero) do exist for turn-based game.
Non-ergodic concurrent games. Existence of the uniform value. Approach by Bewley, Kohlberg, Mertens and Neyman, using the semialgebraic character of the discounted value. Generalization to games definable in o-minimal structures.
Algorithms for mean-payoff games and games with a small discount rate
Value iteration and relative value iteration. Contraction properties in various norms and seminorms. Dobrushin ergodicity coefficient and Birkhoff’s contraction theorem. Example of the stochastic shortest path problem (contraction theorem of Bertsekas and Tsitsiklis). Parameterized complexity bounds for ergodic games.
The theorem of Ye, Hansen, Miltersen and Zwick: policy iteration with a fixed discount factor is strongly polynomial for zero-sum two-player turn-based games.
Reduction of the mean-payoff problem to the discounted problem with small discount rate. Bounds on the Blackwell threshold.
The complexity of concurrent games.
Mean-payoff games and tropical geometry. Embedding mean-payoff games in nonarchimedean linear programs. Link between the complexity of mean-payoff games and the complexity of linear programming