Inverted Index Compression Using Word-Aligned Binary Codes

  • Authors:
  • Vo Ngoc Anh;Alistair Moffat

  • Affiliations:
  • Department of Computer Science and Software Engineering, The University of Melbourne, Victoria 3010, Australia;Department of Computer Science and Software Engineering, The University of Melbourne, Victoria 3010, Australia. alistair@cs.mu.oz.au

  • Venue:
  • Information Retrieval
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

We examine index representation techniques for document-based inverted files, and present a mechanism for compressing them using word-aligned binary codes. The new approach allows extremely fast decoding of inverted lists during query processing, while providing compression rates better than other high-throughput representations. Results are given for several large text collections in support of these claims, both for compression effectiveness and query efficiency.