Document filtering method using non-relevant information profile

  • Authors:
  • Keiichiro Hoashi;Kazunori Matsumoto;Naomi Inoue;Kazuo Hashimoto

  • Affiliations:
  • KDD R&D Laboratories, Inc., 2-1-15 Ohaxa Kamifukuoka, Saitama 356-8502 Japan;KDD R&D Laboratories, Inc., 2-1-15 Ohaxa Kamifukuoka, Saitama 356-8502 Japan;KDD R&D Laboratories, Inc., 2-1-15 Ohaxa Kamifukuoka, Saitama 356-8502 Japan;KDD R&D Laboratories, Inc., 2-1-15 Ohaxa Kamifukuoka, Saitama 356-8502 Japan

  • Venue:
  • SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
  • Year:
  • 2000

Quantified Score

Hi-index 0.00

Visualization

Abstract

Document filtering is a task to retrieve documents relevant to a user's profile from a flow of documents. Generally, filtering systems calculate the similarity between the profile and each incoming document, and retrieve documents with similarity higher than a threshold. However, many systems set a relatively high threshold to reduce retrieval of non-relevant documents, which results in the ignorance of many relevant documents. In this paper, we propose the use of a non-relevant information profile to reduce the mistaken retrieval of non-relevant documents. Results from experiments show that this filter has successfully rejected a sufficient number of non-relevant documents, resulting in an improvement of filtering performance.