A Fast Static Index Pruning Algorithm

  • Authors:
  • Xiaofeng Liu

  • Affiliations:
  • School of Software Engineering, Huazhong University of Science and Technology Wuhan, China

  • Venue:
  • Proceedings of the Second International Conference on Innovative Computing and Cloud Computing
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

As a query processing optimization technique over inverted index, static index pruning can significantly reduce index size and query processing time. A fast static index pruning algorithm is presented, which is a term-centric method and adopts BM25 weighting as the pruning measure. The algorithm scans through documents set with one pass and directly builds pruned index, and therefore avoids the construction of original index. The correctness of the algorithm is proved and the theoretical analysis reveals that its IO performance takes precedence over other algorithms. The experiments based on TREC data set also show that the fast static index pruning algorithm requires less time to build pruned index, and the pruning effectiveness outperforms the baseline method.