Ensemble of binary learners for reliable text categorization with a reject option

  • Authors:
  • Giuliano Armano;Camelia Chira;Nima Hatami

  • Affiliations:
  • Department of Electrical and Electronic Engineering, University of Cagliari, Cagliari, Italy;Department of Computer Science, Babes-Bolyai University, Cluj-Napoca, Romania;Department of Electrical and Electronic Engineering, University of Cagliari, Cagliari, Italy

  • Venue:
  • HAIS'12 Proceedings of the 7th international conference on Hybrid Artificial Intelligent Systems - Volume Part I
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Text categorization is a key task in information retrieval and natural language processing. Providing a reliability measure of the classification result for a text document into a particular category can benefit the recognition rate as well as better inform the user with regard to the confidence that should be attributed to the output. A novel reliability measure is proposed starting from running different binary classifiers in the Error-Correcting Output Codes (ECOC) framework. Documents classified in a particular category which have a higher ECOC-computed distance from their classification in the next ranked category also have a higher associated reliability. This is the main idea explored in the proposed ECOC-based text classifier with a reject option. Experiments performed for some commonly used text categorization benchmark datasets demonstrate the potential of the proposed method.