On-line learning with delayed label feedback

  • Authors:
  • Chris Mesterharm

  • Affiliations:
  • Rutgers Computer Science Department, Piscataway, NJ

  • Venue:
  • ALT'05 Proceedings of the 16th international conference on Algorithmic Learning Theory
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

We generalize on-line learning to handle delays in receiving labels for instances. After receiving an instance x, the algorithm may need to make predictions on several new instances before the label for x is returned by the environment. We give two simple techniques for converting a traditional on-line algorithm into an algorithm for solving a delayed on-line problem. One technique is for instances generated by an adversary; the other is for instances generated by a distribution. We show how these techniques effect the original on-line mistake bounds by giving upper-bounds and restricted lower-bounds on the number of mistakes.