CCM: A Text Classification Model by Clustering

  • Authors:
  • Sarwat Nizamani;Nasrullah Memon;Uffe Kock Wiil;Panagiotis Karampelas

  • Affiliations:
  • -;-;-;-

  • Venue:
  • ASONAM '11 Proceedings of the 2011 International Conference on Advances in Social Networks Analysis and Mining
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, a new Cluster based Classification Model (CCM) for suspicious email detection and other text classification tasks, is presented. Comparative experiments of the proposed model against traditional classification models and the boosting algorithm are also discussed. Experimental results show that the CCM outperforms traditional classification models as well as the boosting algorithm for the task of suspicious email detection on terrorism domain email dataset and topic categorization on the Reuters-21578 and 20 Newsgroups datasets. The overall finding is that applying a cluster based approach to text classification tasks simplifies the model and at the same time increases the accuracy.