Using Laplace and angular measures for Feature Selection in Text Categorisation

  • Authors:
  • Elena Montanes;Pedro Alonso;Elias F. Combarro;Irene Diaz;Raquel Cortina;Jose Ranilla

  • Affiliations:
  • Computer Science Department, University of Oviedo, Spain.;Mathematics Department, University of Oviedo, Spain.;Computer Science Department, University of Oviedo, Spain.;Computer Science Department, University of Oviedo, Spain.;Computer Science Department, University of Oviedo, Spain.;Computer Science Department, University of Oviedo, Spain

  • Venue:
  • International Journal of Advanced Intelligence Paradigms
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Text Categorisation (TC) consists of automatically assigning documents to a set of prefixed categories. It usually involves the management of a huge number of features. Some of them are irrelevant or noisy which mislead the classifiers. Thus, they are reduced to increase the efficiency and effectiveness of the classification. In this paper we propose to select relevant features using two different families of filtering measures, which are simpler than other usual measures applied for this purpose. The experiments over three corpora show that, in general, the proposed measures perform equal or better than the existing ones, sometimes allowing greater reductions.