Making online decisions with bounded memory

  • Authors:
  • Chi-Jen Lu;Wei-Fu Lu

  • Affiliations:
  • Institute of Information Science, Academia Sinica, Taipei, Taiwan;Department of Computer Science and Information Engineering, Asia University, Taiwan

  • Venue:
  • ALT'11 Proceedings of the 22nd international conference on Algorithmic learning theory
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

We study the online decision problem in which there are T steps to play and n actions to choose. For this problem, several algorithms achieve an optimal regret of O(√T ln n), but they all require about Tn states, which one may not be able to afford when n and T are very large. We are interested in such large scale problems, and we would like to understand what an online algorithm can achieve with only a bounded number of states. We provide two algorithms, both with mn-1 states, for a parameter m, which achieve regret of O(m + (T/m) ln (mn)) and O(n√m+T/√m), respectively. We also show that any online algorithm with mn-1 states must suffer a regret of Ω(T/m), which is close to what our algorithms achieve.