Parallel DSIR Text Indexing System: Using Multiple Master/Slave Concept

  • Authors:
  • Pawat Laohawee;Athichat Tangpong;Arnon Rungsawang

  • Affiliations:
  • -;-;-

  • Venue:
  • Proceedings of the 7th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
  • Year:
  • 2000

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we present another study of an improved parallel DSIR text retrieval system that can perform fast indexing of several gigabytes of text collection using Pentium-class PC-cluster. We use multiple-master/slave principle to implement this parallel indexing algorithm. In a computing node, a master process has been designed to work in conjunction with each slave process in order to utilize as much as possible the computing power during one or another process is waiting for I/O. We also present special buffering and caching techniques for boosting the computing performance. We have tested this algorithm and presented the experimental results using a large-scale TREC8 collection and investigated both computing performance and problem size scalability issue.