Using the αβ-Neighborhood for Adaptive Document Filtering

  • Authors:
  • Adrian Fonseca-Bruzón;Reynaldo Gil-García;Aurora Pons-Porrata

  • Affiliations:
  • Center for Pattern Recognition and Data Mining, Universidad de Oriente, Santiago de Cuba, Cuba;Center for Pattern Recognition and Data Mining, Universidad de Oriente, Santiago de Cuba, Cuba;Center for Pattern Recognition and Data Mining, Universidad de Oriente, Santiago de Cuba, Cuba

  • Venue:
  • CIARP '08 Proceedings of the 13th Iberoamerican congress on Pattern Recognition: Progress in Pattern Recognition, Image Analysis and Applications
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we address the problem of adaptive document filtering. Traditionally, user profiles are represented by the centroid of the available examples, assuming that these are homogeneously distributed around this centroid. However, these examples may be irregularly distributed, being some areas more populated than others. While, in this case, the homogeneity assumption may not be globally true, it may still hold locally. In order to handle this phenomenon, we introduce a new approach in which a binary classifier for each user profile is used and more than one document is considered in the classification task. To decide whether a new document is relevant to the user or not, our approach uses a Nearest Neighbor classifier based on a neighborhood which inspects a sufficiently small area surrounding the new document. Experiments carried out on the TREC-11 collection show the effectiveness of the proposed method.