Construction of optimal graphs for bit-vector compression

  • Authors:
  • A. Bookstein;S. T. Klein

  • Affiliations:
  • Center for Information and Language Studies, University of Chicago, 1100 East 57-th Street, Chicago, IL;Center for Information and Language Studies, University of Chicago, 1100 East 57-th Street, Chicago, IL

  • Venue:
  • SIGIR '90 Proceedings of the 13th annual international ACM SIGIR conference on Research and development in information retrieval
  • Year:
  • 1989

Quantified Score

Hi-index 0.00

Visualization

Abstract

Bitmaps are data structures occurring often in information retrieval. They are useful; they are also large and expensive to store. For this reason, considerable effort has been devoted to finding techniques for compressing them. These techniques are most effective for sparse bitmaps. We propose a preprocessing stage, in which bitmaps are first clustered and the clusters used to transform their member bitmaps into sparser ones, that can be more effectively compressed. The clustering method efficiently generates a graph structure on the bitmaps. The results of applying our algorithm to the Bible is presented: for some sets of bitmaps, our method almost doubled the compression savings.