Skip to main navigation Skip to search Skip to main content

Markov chain entropy games and the geometry of their Nash equilibria

  • Michael C.H. Choi
  • , Geoffrey Wolfer

Research output: Contribution to journalArticlepeer-review

Abstract

We introduce and study a two-player zero-sum game between a probabilist and Nature defined by a convex function f, a finite collection B of Markov generators (or its convex hull), and a target distribution π. The probabilist selects a mixed strategy µ ∈ P(B), the set of probability measures on B, while Nature adopts a pure strategy and selects a π-reversible Markov generator M. The probabilist receives a payoff equal to the f-divergence Df(M∥L), where L is drawn according to µ. We prove that this game always admits a mixed strategy Nash equilibrium and satisfies a minimax identity. In contrast, a pure strategy equilibrium may fail to exist. We develop a projected subgradient method to compute approximate mixed strategy equilibria with provable convergence guarantees. Connections to information centroids, Chebyshev centers, and Bayes risk are discussed. This paper extends earlier minimax results on f-divergences such as (Haussler, 1997; Haussler and Opper, 1997; Gushchin and Zhdanov, 2006) to the context of Markov generators.

Original languageEnglish
Pages (from-to)925-952
Number of pages28
JournalAlea
Volume22
DOIs
StatePublished - 1 Jan 2025
Externally publishedYes

Keywords

  • 49J35
  • 60J27
  • 60J28
  • 62B10
  • 62C20
  • 90C47
  • 91A05
  • 91A68
  • 94A17
  • 94A29
  • Bayes risk
  • Chebyshev center
  • Markov chains
  • Nash equilibrium
  • algorithmic game theory
  • f-divergence
  • information centroid
  • information geometry
  • minimax theorem
  • saddle point
  • subgradient

ASJC Scopus subject areas

  • Statistics and Probability

Fingerprint

Dive into the research topics of 'Markov chain entropy games and the geometry of their Nash equilibria'. Together they form a unique fingerprint.

Cite this