Bandit problems with lévy processes

Asaf Cohen, Eilon Solan

Research output: Contribution to journalArticlepeer-review

17 Scopus citations

Abstract

Bandit problems model the trade-off between exploration and exploitation in various decision problems. We study two-armed bandit problems in continuous time, where the risky arm can have two types: High or Low; both types yield stochastic payoffs generated by a Lévy process. We show that the optimal strategy is a cut-off strategy and we provide an explicit expression for the cut-off and for the optimal payoff.

Original languageEnglish
Pages (from-to)92-107
Number of pages16
JournalMathematics of Operations Research
Volume38
Issue number1
DOIs
StatePublished - 1 Feb 2013
Externally publishedYes

Keywords

  • Cut-off strategies
  • Lévy processes
  • Two-armed bandit

ASJC Scopus subject areas

  • General Mathematics
  • Computer Science Applications
  • Management Science and Operations Research

Fingerprint

Dive into the research topics of 'Bandit problems with lévy processes'. Together they form a unique fingerprint.

Cite this