High Relevance Keyword Extraction facility for Bayesian text classification on different domains of varying characteristic

  • Authors:
  • Lam Hong Lee;Dino Isa;Wou Onn Choo;Wen Yeen Chue

  • Affiliations:
  • Faculty of Information and Communication Technology - Perak Campus, Universiti Tunku Abdul Rahman, Bandar Barat, 31900 Kampar, Perak, Malaysia;Intelligent Systems Research Group, Faculty of Engineering, The University of Nottingham, Malaysia Campus, Jalan Broga, 43500 Semenyih, Selangor, Malaysia;Faculty of Information and Communication Technology - Perak Campus, Universiti Tunku Abdul Rahman, Bandar Barat, 31900 Kampar, Perak, Malaysia;Faculty of Business and Finance, Universiti Tunku Abdul Rahman, Bandar Barat, 31900 Kampar, Perak, Malaysia

  • Venue:
  • Expert Systems with Applications: An International Journal
  • Year:
  • 2012

Quantified Score

Hi-index 12.05

Visualization

Abstract

High Relevance Keyword Extraction (HRKE) facility is introduced to Bayesian text classification to perform feature/keyword extraction during the classifying stage, without needing extensive pre-classification processes. In order to perform the task of keyword extraction, HRKE facility uses the posterior probability value of keywords within a specific category associated with text document. The experimental results show that HRKE facility is able to ensure promising classification performance for Bayesian classifier while dealing with different text classification domains of varying characteristics. This method guarantees an effective and efficient Bayesian text classifier which is able to handle different domains of varying characteristics, with high accuracy while maintaining the simplicity and low cost processes of the conventional Bayesian classification approach.