Data allocation scheme based on term weight for P2P information retrieval

  • Authors:
  • Hisashi Kurasawa;Hiromi Wakaki;Atsuhiro Takasu;Jun Adachi

  • Affiliations:
  • The University of Tokyo, Tokyo, Japan;Toshiba R & D Center, Yokohama, Japan;National Institute of Informatics, Tokyo, Japan;National Institute of Informatics, Tokyo, Japan

  • Venue:
  • Proceedings of the 9th annual ACM international workshop on Web information and data management
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Many Peer-to-Peer information retrieval systems that use a global index have already been proposed that can retrieve documents relevant to a query. Since documents are allocated to peers regardless of the query, the system needs to connect many peers to gather the relevant documents. We propose a new data allocation scheme for P2P information retrieval that we call Concordia. Concordia uses a node to allocate a document based on the weight of each term in the document to efficiently assemble all the documents relevant to a query from the P2P Network. Moreover, the node encodes the binary data of a document with an erasure code, and Concordia produces an efficient redundancy for counteracting node failures.