Low-Cost Parallel Text Retrieval Using PC-Cluster

  • Authors:
  • Arnon Rungsawang;A. Laohakanniyom;M. Lertprasertkune

  • Affiliations:
  • -;-;-

  • Venue:
  • Proceedings of the 8th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a parallel vector space based text retrieval prototype implemented on a low-cost PC cluster running Linux operating system, using the PVM message passing library. We also embed the inverted file structure into our proposed prototype for fast retrieval. From several experiments derived from the standard TREC-9 collection, this prototype can index up to 500,000 web pages per hour using a simple x86 machine. We also obtain 5.4 seconds query response time on searching in the one and a half million TREC-9 web pages, using 2 machines.