Adaptive Bitmap Indexes for Space-Constrained Systems

  • Authors:
  • Rishi Rakesh Sinha;Marianne Winslett;Kesheng Wu;Kurt Stockinger;Arie Shoshani

  • Affiliations:
  • Microsoft Corporation USA. rsinha@microsoft.com;Department of Computer Science, University of Illinois @ Urbana-Champaign, USA. winslett@cs.uiuc.edu;Lawrence Berkeley National Lab, USA. kwu@lbl.gov;Lawrence Berkeley National Lab, USA. stockinger@lbl.gov;Lawrence Berkeley National Lab, USA. shoshani@lbl.gov

  • Venue:
  • ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Data management systems for "big science" often have tight memory and disk space constraints. In this paper, we introduce adaptive bitmap indexes, which conform to both space limits while dynamically adapting to the query load and offering excellent performance. So that adaptive bitmap indexes can use optimal bin boundaries, we show how to improve the scalability of optimal binning algorithms so that they can be used with real-world workloads. As the removal of false positives is the largest component of lookup time for a small-footprint bitmap index, we propose a novel way to materialize and drop auxiliary projection indexes, to eliminate the need to visit the data store to check for false positives. Our experiments with real-world data and queries show that adaptive bitmap indexes offer approximately 100-300% performance improvement (compared to standard binned bitmap indexes) at a cost of 5 MB of dedicated memory, under disk storage constraints that would cripple other indexes.