Abstract
We introduce and study a two-player zero-sum game between a probabilist and Nature defined by a convex function f, a finite collection B of Markov generators (or its convex hull), and a target distribution π. The probabilist selects a mixed strategy µ ∈ P(B), the set of probability measures on B, while Nature adopts a pure strategy and selects a π-reversible Markov generator M. The probabilist receives a payoff equal to the f-divergence Df(M∥L), where L is drawn according to µ. We prove that this game always admits a mixed strategy Nash equilibrium and satisfies a minimax identity. In contrast, a pure strategy equilibrium may fail to exist. We develop a projected subgradient method to compute approximate mixed strategy equilibria with provable convergence guarantees. Connections to information centroids, Chebyshev centers, and Bayes risk are discussed. This paper extends earlier minimax results on f-divergences such as (Haussler, 1997; Haussler and Opper, 1997; Gushchin and Zhdanov, 2006) to the context of Markov generators.
| Original language | English |
|---|---|
| Pages (from-to) | 925-952 |
| Number of pages | 28 |
| Journal | Alea |
| Volume | 22 |
| DOIs | |
| State | Published - 1 Jan 2025 |
| Externally published | Yes |
Keywords
- 49J35
- 60J27
- 60J28
- 62B10
- 62C20
- 90C47
- 91A05
- 91A68
- 94A17
- 94A29
- Bayes risk
- Chebyshev center
- Markov chains
- Nash equilibrium
- algorithmic game theory
- f-divergence
- information centroid
- information geometry
- minimax theorem
- saddle point
- subgradient
ASJC Scopus subject areas
- Statistics and Probability
Fingerprint
Dive into the research topics of 'Markov chain entropy games and the geometry of their Nash equilibria'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver