Less hashing, same performance: Building a better Bloom filter

  • Authors:
  • Adam Kirsch;Michael Mitzenmacher

  • Affiliations:
  • Harvard School of Engineering and Applied Sciences, Cambridge, Massachusetts 02138;Harvard School of Engineering and Applied Sciences, Cambridge, Massachusetts 02138

  • Venue:
  • Random Structures & Algorithms
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

A standard technique from the hashing literature is to use two hash functions h1(x) and h2(x) to simulate additional hash functions of the form gi(x) = h1(x) + ih2(x). We demonstrate that this technique can be usefully applied to Bloom filters and related data structures. Specifically, only two hash functions are necessary to effectively implement a Bloom filter without any loss in the asymptotic false positive probability. This leads to less computation and potentially less need for randomness in practice. © 2008 Wiley Periodicals, Inc. Random Struct. Alg., 2008 Preliminary versions of this work appeared in [17] and [16].