Improved range-summable random variable construction algorithms

  • Authors:
  • A. R. Calderbank;A. Gilbert;K. Levchenko;S. Muthukrishnan;M. Strauss

  • Affiliations:
  • Princeton University, Princeton, New Jersey;University of Michigan, Ann Arbor, Michigan;University of California San Diego, La Jolla, California;Rutgers University, Piscataway, New Jersey;University of Michigan, Ann Arbor, Michigan

  • Venue:
  • SODA '05 Proceedings of the sixteenth annual ACM-SIAM symposium on Discrete algorithms
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

Range-summable universal hash functions, also known as range-summable random variables, are binary-valued hash functions which can efficiently hash single values as well as ranges of values from the domain. They have found several applications in the area of data stream processing where they are used to construct sketches---small-space summaries of the input sequence.We present two new constructions of range-summable universal hash functions on n-bit strings, one based on Reed-Muller codes which gives k-universal hashing using O(nlog k) space and time for point operations and O(n2 log k) for range operations, and another based on a new subcode of the second-order Reed-Muller code, which gives 5-universal hashing using O(n) space, O(n log3 n) time for point operations, and O(n3) time for range operations.We also present a new sketch data structure using the new hash functions which improves several previous results.