Training classifiers for unbalanced distribution and cost-sensitive domains with ROC analysis

  • Authors:
  • Xiaolong Zhang;Chuan Jiang;Ming-jian Luo

  • Affiliations:
  • School of Computer Science and Technology, Wuhan University of Science and Technology, Wuhan, China;School of Computer Science and Technology, Wuhan University of Science and Technology, Wuhan, China;School of Computer Science and Technology, Wuhan University of Science and Technology, Wuhan, China

  • Venue:
  • PKAW'06 Proceedings of the 9th Pacific Rim Knowledge Acquisition international conference on Advances in Knowledge Acquisition and Management
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

ROC (Receiver Operating Characteristic) has been used as a tool for the analysis and evaluation of two-class classifiers, even the training data embraces unbalanced class distribution and cost-sensitiveness. However, ROC has not been effectively extended to evaluate multi-class classifiers. In this paper, we proposed an effective way to deal with multi-class learning with ROC analysis. An EMAUC algorithm is implemented to transform a multi-class training set into several two-class training sets. Classification is carried out with these two-class training sets. Empirical results demonstrate that the classifiers trained with the proposed algorithm have competitive performance for unbalanced distribution and cost-sensitive domains.