Adaptive regularization of weight vectors

  • Authors:
  • Koby Crammer;Alex Kulesza;Mark Dredze

  • Affiliations:
  • Department of Electrical Engineering, The Technion, Haifa, Israel 32000;Department of Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, USA 48109;Human Language Technology Center of Excellence, Johns Hopkins University, Baltimore, USA 21211

  • Venue:
  • Machine Learning
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present AROW, an online learning algorithm for binary and multiclass problems that combines large margin training, confidence weighting, and the capacity to handle non-separable data. AROW performs adaptive regularization of the prediction function upon seeing each new instance, allowing it to perform especially well in the presence of label noise. We derive mistake bounds for the binary and multiclass settings that are similar in form to the second order perceptron bound. Our bounds do not assume separability. We also relate our algorithm to recent confidence-weighted online learning techniques. Empirical evaluations show that AROW achieves state-of-the-art performance on a wide range of binary and multiclass tasks, as well as robustness in the face of non-separable data.