A framework for a feedback process to analyze and personalize a document vector space in a feature extraction model

  • Authors:
  • Kosuke Takano;Xing Chen;Keisuke Masuda

  • Affiliations:
  • Department of Information and Computer Sciences, Faculty of Information Technology, Kanagawa Institute of Technology, Atsugi, Japan 243-0292;Department of Information and Computer Sciences, Faculty of Information Technology, Kanagawa Institute of Technology, Atsugi, Japan 243-0292;Graduate School of Information and Computer Sciences, Kanagawa Institute of Technology, Atsugi, Japan 243-0292

  • Venue:
  • Information Technology and Management
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we present a framework for a feedback process to implement a highly accurate document retrieval system. In the system, a document vector space is created dynamically to implement retrieval processing. The retrieval accuracy of the system depends on the vector space. When the vector space is created based on a specific purpose and interest of a user, highly accurate retrieval results can be obtained. In this paper, we present a method for analyzing and personalizing the vector space according to the purposes and interests of users. In order to optimize the document vector space, we defined and implemented functions for the operations of adding, deleting and weighting the terms that were used to create the vector space. By exploiting effectively and dynamically the classified-document information related to the queries, our methods allow users to retrieve relevant documents for their interests and purposes. Even if the search results of the initial retrieval space are not appropriate, by applying the proposed feedback operations, our proposed method effectively improves the search results. We also implemented an experimental search system for semantic document retrieval. Several experimental results including comparisons of our method with the traditional relevance feedback method is presented to clarify how retrieval accuracy was improved by the feedback process and how accurately documents that satisfied the purpose and interests of users were extracted.