A combined semi-pipelined query processing architecture for distributed full-text retrieval

  • Authors:
  • Simon Jonassen;Svein Erik Bratsberg

  • Affiliations:
  • Department of Computer and Information Science, Norwegian University of Science and Technology, Trondheim, Norway;Department of Computer and Information Science, Norwegian University of Science and Technology, Trondheim, Norway

  • Venue:
  • WISE'10 Proceedings of the 11th international conference on Web information systems engineering
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Term-partitioning is an efficient way to distribute a large inverted index. Two fundamentally different query processing approaches are pipelined and non-pipelined. While the pipelined approach provides higher query throughput, the non-pipelined approach provides shorter query latency. In this work we propose a third alternative, combining non-pipelined inverted index access, heuristic decision between pipelined and non-pipelined query execution and an improved query routing strategy. From our results, the method combines the advantages of both approaches and provides high throughput and short query latency. Our method increases the throughput by up to 26% compared to the nonpipelined approach and reduces the latency by up to 32% compared to the pipelined.