Bandit Problems with Lévy Processes

  • Authors:
  • Asaf Cohen;Eilon Solan

  • Affiliations:
  • The School of Mathematical Sciences, Tel Aviv University, Tel Aviv 69978, Israel;The School of Mathematical Sciences, Tel Aviv University, Tel Aviv 69978, Israel

  • Venue:
  • Mathematics of Operations Research
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

Bandit problems model the trade-off between exploration and exploitation in various decision problems. We study two-armed bandit problems in continuous time, where the risky arm can have two types: High or Low; both types yield stochastic payoffs generated by a Lévy process. We show that the optimal strategy is a cut-off strategy and we provide an explicit expression for the cut-off and for the optimal payoff.