Tracking Drifting Concepts By Minimizing Disagreements

  • Authors:
  • David P. Helmbold;Philip M. Long

  • Affiliations:
  • CIS board, UC Santa Cruz, Santa Cruz, CA 95064. DPH@CSE.UCSC.EDU;Institute for Theoretical Computer Science, Technische Universitaet Graz, Klosterwiesgasse 32/2, A-8010 Graz, Austria. PLONG@IGI.TU-GRAZ.AC.AT

  • Venue:
  • Machine Learning - Special issue on computational learning theory
  • Year:
  • 1994

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we consider the problem of tracking a subset of a domain (called the target) which changes gradually over time. A single (unknown) probability distribution over the domain is used to generate random examples for the learning algorithm and measure the speed at which the target changes. Clearly, the more rapidly the target moves, the harder it is for the algorithm to maintain a good approximation of the target. Therefore we evaluate algorithms based on how much movement of the target can be tolerated between examples while predicting with accuracy ε Furthermore, the complexity of the class {\CAL H} of possible targets, as measured by d, its VC-dimension, also effects the difficulty of tracking the target concept. We show that if the problem of minimizing the number of disagreements with a sample from among concepts in a class {\CAL H} can be approximated to within a factor k, then there is a simple tracking algorithm for {\CAL H} which can achieve a probability ε of making a mistake if the target movement rate is at most a constant times epsilon^2/(k(d+k){\rm ln}{1\over \epsilon}), where d is the Vapnik-Chervonenkis dimension of {\CAL H}. Also, we show that if {\CAL H} is properly PAC-learnable, then there is an efficient (randomized) algorithm that with high probability approximately minimizes disagreements to within a factor of 7d + 1, yielding an efficient tracking algorithm for {\CAL H} which tolerates drift rates up to a constant times \epsilon^2/(d^2{\rm ln}{1\over \epsilon}). In addition, we prove complementary results for the classes of halfspaces and axis-aligned hyperrectangles showing that the maximum rate of drift that any algorithm (even with unlimited computational power) can tolerate is a constant times ε2/d.