Active learning for probability estimation using jensen-shannon divergence

  • Authors:
  • Prem Melville;Stewart M. Yang;Maytal Saar-Tsechansky;Raymond Mooney

  • Affiliations:
  • Dept. of Computer Sciences, Univ. of Texas at Austin;Dept. of Computer Sciences, Univ. of Texas at Austin;Red McCombs School of Business, Univ. of Texas at Austin;Dept. of Computer Sciences, Univ. of Texas at Austin

  • Venue:
  • ECML'05 Proceedings of the 16th European conference on Machine Learning
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

Active selection of good training examples is an important approach to reducing data-collection costs in machine learning; however, most existing methods focus on maximizing classification accuracy. In many applications, such as those with unequal misclassification costs, producing good class probability estimates (CPEs) is more important than optimizing classification accuracy. We introduce novel approaches to active learning based on the algorithms Bootstrap-LV and ActiveDecorate, by using Jensen-Shannon divergence (a similarity measure for probability distributions) to improve sample selection for optimizing CPEs. Comprehensive experimental results demonstrate the benefits of our approaches.