Combining Multiple Feature Selection Methods for Text Categorization by Using Rank-Score Characteristics

  • Authors:
  • Yanjun Li;D. Frank Hsu;Soon M. Chung

  • Affiliations:
  • -;-;-

  • Venue:
  • ICTAI '09 Proceedings of the 2009 21st IEEE International Conference on Tools with Artificial Intelligence
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Feature selection is an important method for improving the efficiency and accuracy of text categorization algorithmsby removing redundant and irrelevant terms from the corpus.Extensive researches have been done to improve the performance ofindividual feature selection methods, but not much on their combinations.In this paper, we propose a method of combining multiple feature selection methods by using the Combinatorial Fusion Analysis (CFA). A rank-score function and its graph, called rank-score graph,are adopted to measure the diversity of different feature selection methods.We have shown that a combination of multiple feature selection methods can outperform a single method only if each individual feature selection method has unique scoring behavior and relatively high performance. Moreover, it is shown that the rank-score function and rank-score graph are useful for the selection of a combination of feature selection methods.