An empirical study of the noise impact on cost-sensitive learning

  • Authors:
  • Xingquan Zhu;Xindong Wu;Taghi M. Khoshgoftaar;Yong Shi

  • Affiliations:
  • Dept. of Computer Science & Eng., Florida Atlantic University, Boca Raton, FL and Graduate University, Chinese Academy of Sciences, Beijing, China;Department of Computer Science, University of Vermont, Burlington, VT;Dept. of Computer Science & Eng., Florida Atlantic University, Boca Raton, FL;Graduate University, Chinese Academy of Sciences, Beijing, China

  • Venue:
  • IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we perform an empirical study of the impact of noise on cost-sensitive (CS) learning, through observations on how a CS learner reacts to the mislabeled training examples in terms of misclassification cost and classification accuracy. Our empirical results and theoretical analysis indicate that mislabeled training examples can raise serious concerns for cost-sensitive classification, especially when misclassifying some classes becomes extremely expensive. Compared to general inductive learning, the problem of noise handling and data cleansing is more crucial, and should be carefully investigated to ensure the success of CS learning.