Finding optimal linear measures for feature selection in text categorization

  • Authors:
  • Elena Montañés;E. F. Combarro;José Ranilla;Irene Díaz

  • Affiliations:
  • University of Oviedo, Gijón, Spain;University of Oviedo, Gijón, Spain;University of Oviedo, Gijón, Spain;University of Oviedo

  • Venue:
  • Proceedings of the 2006 ACM symposium on Applied computing
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

A common way of performing Feature Selection in Text Categorization consists in keeping the features with highest score according to certain measures, like linear ones which have been successfully proposed in [1]. Its disadvantage is that they need to previously determine the parameter which defines them. Until now, this drawback has been overcome by taking manually a set of values for such parameter. This paper proposes a method for automatically determining optimal values of the parameter by means of solving a univariate maximization problem.