Cache-, hash- and space-efficient bloom filters

  • Authors:
  • Felix Putze;Peter Sanders;Johannes Singler

  • Affiliations:
  • Fakultät für Informatik, Universität Karlsruhe, Germany;Fakultät für Informatik, Universität Karlsruhe, Germany;Fakultät für Informatik, Universität Karlsruhe, Germany

  • Venue:
  • WEA'07 Proceedings of the 6th international conference on Experimental algorithms
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

A Bloom filter is a very compact data structure that supports approximate membership queries on a set, allowing false positives. We propose several new variants of Bloom filters and replacements with similar functionality. All of them have a better cache-efficiency and need less hash bits than regular Bloom filters. Some use SIMD functionality, while the others provide an even better space efficiency. As a consequence, we get a more flexible trade-off between false positive rate, space-efficiency, cache-efficiency, hash-efficiency, and computational effort. We analyze the efficiency of Bloom filters and the proposed replacements in detail, in terms of the false positive rate, the number of expected cache-misses, and the number of required hash bits. We also describe and experimentally evaluate the performance of highly-tuned implementations. For many settings, our alternatives perform better than the methods proposed so far.