Finite dimensional algorithms for the hidden Markov model multi-armed bandit problem

  • Authors:
  • V. Krishnamurthy;J. Mickova

  • Affiliations:
  • Dept. of Electr. Eng., Melbourne Univ., Parkville, Vic., Australia;-

  • Venue:
  • ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 05
  • Year:
  • 1999

Quantified Score

Hi-index 0.00

Visualization

Abstract

The multi-arm bandit problem is widely used in scheduling of traffic in broadband networks, manufacturing systems and robotics. This paper presents a finite dimensional optimal solution to the multi-arm bandit problem for hidden Markov models. The key to solving any multi-arm bandit problem is to compute the Gittins (1979, 1989) index. In this paper a finite dimensional algorithm is presented which exactly computes the Gittins index. Suboptimal algorithms for computing the Gittins index are also presented and experimentally shown to perform almost as well as the optimal method. Finally an application of the algorithms to tracking multiple targets with a single intelligent sensor is presented.