Chinese text classification by the Naïve Bayes Classifier and the associative classifier with multiple confidence threshold values

  • Authors:
  • Shing-Hwa Lu;Ding-An Chiang;Huan-Chao Keh;Hui-Hua Huang

  • Affiliations:
  • Department of Urology, School of Medicine, National Yang-Ming University, Taiwan and Department of Urology, Taipei City Hospital, Taipei, Taiwan;Department of Computer Science and Information Engineering, Tamkang University, Taipei, Taiwan;Department of Computer Science and Information Engineering, Tamkang University, Taipei, Taiwan;Department of Computer Science and Information Engineering, Tamkang University, Taipei, Taiwan

  • Venue:
  • Knowledge-Based Systems
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Each type of classifier has its own advantages as well as certain shortcomings. In this paper, we take the advantages of the associative classifier and the Naive Bayes Classifier to make up the shortcomings of each other, thus improving the accuracy of text classification. We will classify the training cases with the Naive Bayes Classifier and set different confidence threshold values for different class association rules (CARs) to different classes by the obtained classification accuracy rate of the Naive Bayes Classifier to the classes. Since the accuracy rates of all selected CARs of the class are higher than that obtained by the Naive Bayes Classifier, we could further optimize the classification result through these selected CARs. Moreover, for those unclassified cases, we will classify them with the Naive Bayes Classifier. The experimental results show that combining the advantages of these two different classifiers better classification result can be obtained than with a single classifier.