A new nearest neighbor rule for text categorization

  • Authors:
  • Reynaldo Gil-García;Aurora Pons-Porrata

  • Affiliations:
  • Center of Pattern Recognition and Data Mining, Universidad de Oriente, Santiago de Cuba, Cuba;Center of Pattern Recognition and Data Mining, Universidad de Oriente, Santiago de Cuba, Cuba

  • Venue:
  • CIARP'06 Proceedings of the 11th Iberoamerican conference on Progress in Pattern Recognition, Image Analysis and Applications
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

The nearest neighbor (NN) rule is usually chosen in a large number of pattern recognition systems due to its simplicity and good properties. In particular, this rule has been successfully applied to text categorization. A vast number of NN algorithms have been developed during the last years. They differ in how they find the nearest neighbors, how they obtain the votes of categories, and which decision rule they use. A new NN classification rule which comes from the use of a different definition of neighborhood is introduced in this paper. The experimental results on Reuters-21578 standard benchmark collection show that our algorithm achieves better classification rates than the k-NN rule while decreasing classification time.