Stochastic Multiplayer Games Theory and Algorithms

Michael Ummels received his diploma degree in computer science from RWTH Aachen
University. He started his doctoral studies at the same university in 2006, supervised
by Prof. Dr. Erich Grädel and Prof. Dr. Dr.h.c. Wolfgang Thomas. As of February 2010,
the author is a postdoctoral researcher at ENS Cachan.
Stochastic Multiplayer Games: Theory and Algorithms
Stochastic games provide a versatile model for reactive systems that are
affected by random events. This dissertation advances the algorithmic theory
of stochastic games to incorporate multiple players, whose objectives are not
necessarily conflicting. The basis of this work is a comprehensive complexitytheoretic analysis of the standard game-theoretic solution concepts in the
context of stochastic games over a finite state space. One main result is that
the constrained existence of a Nash equilibrium becomes undecidable in this
setting. This impossibility result is accompanied by several positive results,
including efficient algorithms for natural special cases.
Stochastic Multiplayer Games
Theory and Algorithms
Michael Ummels
Michael Ummels
Stochastic Multiplayer Games
Theory and Algorithms
Typeset by the author in Fedra Serif B using LATEX.
Cover design by Sam Ross-Gower.
ISBN 978 90 8555 040 2
NUR 918
D 82 (Diss. RWTH Aachen University, 2010)
© Michael Ummels, 2010.
Published by Pallas Publications, Amsterdam University Press, Amsterdam.
cbnd
This work is licensed under the Creative Commons Attribution-NonCommercialNoDerivs 3.0 Unported License.
To view a copy of this license, visit http://
creativecommons.org/licenses/by-nc-nd/3.0/, or send a letter to Creative Commons, 171 2nd Street, Suite 300, San Francisco, California, 94105, USA.
Stochastic Multiplayer Games:
Theory and Algorithms
Von der Fakultät für Mathematik, Informatik und
Naturwissenschaften der RWTH Aachen University
zur Erlangung des akademischen Grades eines Doktors
der Naturwissenschaften genehmigte Dissertation
vorgelegt von
Diplom-Informatiker
Michael Ummels
aus Köln
Berichter: Universitätsprofessor Dr. Erich Grädel
Universitätsprofessor Dr. Wolfgang Thomas
Assistant Professor Dr. Marcin Jurdziński
Tag der mündlichen Prüfung: 27. Januar 2010
Diese Dissertation ist auf den Internetseiten der Hochschulbibliothek online verfügbar.
Preface
The last decades have seen an immense amount of research on the algorithmic content of game theory. On the one hand, a new subject called
algorithmic game theory has emerged that is concerned with the study of the
algorithmic theory of finite games with multiple players. On the other hand,
infinite and, in particular, stochastic two-player zero-sum games have become
an important tool for the verification of open systems, which interact with
their environment.
The aim of this work is to bring together algorithmic game theory with
the games that are used in verification by extending the algorithmic theory of stochastic two-player zero-sum games to incorporate multiple players, whose objectives are not necessarily conflicting. In particular, this
work contains a comprehensive study of the complexity of the most prominent solution concepts that are applicable in this setting, namely Nash and
subgame-perfect equilibria.
This book is the result of my doctoral studies at RWTH Aachen University. I am indebted to my primary supervisor Erich Grädel for giving me the
opportunity to pursue these studies, for introducing me to the scientific community and for giving me advice just when I needed it. I am equally grateful
to my secondary supervisor Wolfgang Thomas for his constant support and
encouragement.
Marcin Jurdziński did not hesitate to act as an external reviewer for this
thesis. I thank him not only for his careful reading and numerous remarks,
but also for giving an inspiring talk on branching vector addition systems,
which indirectly led to the resolution of a problem that was left open in the
original version of this thesis.
A substantial part of this book is based on joint work with Dominik Wojtczak. I am indebted to him for our numerous illuminating discussions,
for his insights and ideas, and—last but not least—for hosting me in Edinburgh, Amsterdam and Oxford.
5
Among the various other people who contributed to this work, I would
like to thank in particular Łukasz Kaiser for many enlightening discussions
and for discovering Proposition 3.18. Special thanks also go to Florian Horn
for many interesting discussions, to János Flesch for pointing out Proposition 3.13, and to Peter Bro Miltersen for drawing my attention to Corollary 4.4.
Moreover, I am grateful to Hugo Gimbert and Eilon Solan for answering my
questions and to Rohit Chadha, Tobias Ganzow, Jörg Olschewski and Edeline
Wong for their comments on preliminary drafts of this work.
Finally, I would like to thank Sam Ross-Gower for designing the cover of
this book, and Donald Knuth and Leslie Lamport for creating (LA)TEX.
Paris, November 2010
6
Contents
1 Introduction • 15
1.1 Games and equilibria • 15
1.2 The stochastic dining philosophers problem • 21
1.3 Contributions • 25
1.4 Related work • 27
1.5 Outline • 28
2 Stochastic Games • 31
2.1 Arenas and objectives • 31
2.2 Strategies and strategy profiles • 37
2.3 Subarenas and end components • 41
2.4 Values, determinacy and optimal strategies • 42
2.5 Algorithmic problems • 47
2.6 Existence of residually optimal strategies • 51
3 Equilibria • 55
3.1 Definitions and basic properties • 55
3.2 Existence of Nash equilibria • 59
3.3 Existence of subgame-perfect equilibria • 64
3.4 Computing equilibria • 69
3.5 Decision problems • 73
4 Complexity of Equilibria • 77
4.1 Positional equilibria • 77
4.2 Stationary equilibria • 82
4.3 Pure and randomised equilibria • 88
4.4 Finite-state equilibria • 96
4.5 Summary of results • 98
7
Contents
5 Decidable Fragments • 99
5.1 The strictly qualitative fragment • 99
5.2 The positive-one fragment • 113
5.3 The qualitative fragment for deterministic games • 122
5.4 Summary of results • 133
6 Conclusion • 135
6.1 Summary and open problems • 135
6.2 Perspectives • 138
A Preliminaries • 141
A.1 Probability theory • 141
A.2 Computational complexity • 144
B Markov Chains and Markov Decision Processes • 149
B.1 Markov chains • 149
B.2 Markov decision processes • 152
Bibliography • 157
Notation • 169
Index • 171
8
List of Figures
1.1 Matching pennies as a game in extensive form • 18
1.2 Dining philosophers • 22
1.3 Processes for the stochastic dining philosophers problem • 23
1.4 The stochastic dining philosophers game with two
philosophers • 24
2.1 A hierarchy of prefix-independent objectives • 36
2.2 An example of a two-player SSMG • 38
2.3 An MDP with no optimal strategy • 44
2.4 Another MDP with no optimal strategy • 44
3.1 A two-player reachability game with an irrational Nash
equilibrium • 58
3.2 A two-player game with a pair of optimal strategies that cannot
be extended to a Nash equilibrium • 62
3.3 An SSMG with no stationary Nash equilibrium • 63
3.4 A two-player SSMG with no positional Nash equilibrium • 64
3.5 A Büchi SMG with no subgame-perfect equilibrium • 68
3.6 An SSMG that has a stationary subgame-perfect equilibrium
where player 0 wins almost surely but no pure Nash equilibrium
where player 0 wins with positive probability • 75
3.7 The different decision problems related to Nash and
subgame-perfect equilibria • 76
4.1 Reducing SAT to PosNE, StatNE, PosSPE and StatSPE • 79
4.2 Reducing SqrtSum to StatNE and StatSPE • 85
4.3 Simulating a two-counter machine • 90
4.4 Reducing from the halting problem • 97
9