Scoring-Thresholding pattern based text classifier

  • Authors:
  • Moch Arif Bijaksana;Yuefeng Li;Abdulmohsen Algarni

  • Affiliations:
  • School of Electrical Engineering and Computer Science, Queensland University of Technology, Brisbane, Australia;School of Electrical Engineering and Computer Science, Queensland University of Technology, Brisbane, Australia;School of Electrical Engineering and Computer Science, Queensland University of Technology, Brisbane, Australia

  • Venue:
  • ACIIDS'13 Proceedings of the 5th Asian conference on Intelligent Information and Database Systems - Volume Part I
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

A big challenge for classification on text is the noisy of text data. It makes classification quality low. Many classification process can be divided into two sequential steps scoring and threshold setting (thresholding). Therefore to deal with noisy data problem, it is important to describe positive feature effectively scoring and to set a suitable threshold. Most existing text classifiers do not concentrate on these two jobs. In this paper, we propose a novel text classifier with pattern-based scoring that describe positive feature effectively, followed by threshold setting. The thresholding is based on score of training set, make it is simple to implement in other scoring methods. Experiment shows that our pattern-based classifier is promising.