The Power of Simple Tabulation Hashing

Authors:
Mihai Pǎtraşcu;Mikkel Thorup
Affiliations:
AT&T Labs---Research;AT&T Labs---Research
Venue:
Journal of the ACM (JACM)
Year:
2012

Citing 20
Cited 3

Randomized algorithms and pseudorandom numbers

Journal of the ACM (JACM)
Randomized algorithms

Randomized algorithms
Chernoff-Hoeffding Bounds for Applications with Limited Independence

SIAM Journal on Discrete Mathematics
A reliable randomized algorithm for the closest-pair problem

Journal of Algorithms
The space complexity of approximating the frequency moments

Journal of Computer and System Sciences
Balanced Allocations

SIAM Journal on Computing
Even strongly universal hashing is pretty fast

SODA '00 Proceedings of the eleventh annual ACM-SIAM symposium on Discrete algorithms
Min-wise independent permutations

Journal of Computer and System Sciences - 30th annual ACM symposium on theory of computing
A small approximately min-wise independent family of hash functions

Journal of Algorithms
Universal Hashing and k-Wise Independent Random Variables via Integer Arithmetic without Primes

STACS '96 Proceedings of the 13th Annual Symposium on Theoretical Aspects of Computer Science
Almost random graphs with simple hash functions

Proceedings of the thirty-fifth annual ACM symposium on Theory of computing
On Universal Classes of Extremely Random Constant-Time Hash Functions

SIAM Journal on Computing
Cuckoo hashing

Journal of Algorithms
Why simple hash functions work: exploiting the entropy in a data stream

Proceedings of the nineteenth annual ACM-SIAM symposium on Discrete algorithms
String hashing for linear probing

SODA '09 Proceedings of the twentieth Annual ACM-SIAM Symposium on Discrete Algorithms
On risks of using cuckoo hashing with simple universal hash classes

SODA '09 Proceedings of the twentieth Annual ACM-SIAM Symposium on Discrete Algorithms
Applications of a Splitting Trick

ICALP '09 Proceedings of the 36th International Colloquium on Automata, Languages and Programming: Part I
Linear Probing with Constant Independence

SIAM Journal on Computing
On the k-independence required by linear probing and minwise independence

ICALP'10 Proceedings of the 37th international colloquium conference on Automata, languages and programming
Tabulation-Based 5-Independent Hashing with Applications to Linear Probing and Second Moment Estimation

SIAM Journal on Computing

Mihai Pǎtraşcu: obituary and open problems

ACM SIGACT News
Software defined traffic measurement with OpenSketch

nsdi'13 Proceedings of the 10th USENIX conference on Networked Systems Design and Implementation
STRIP: stream learning of influence probabilities

Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining

Quantified Score

Hi-index	0.00

Visualization

Abstract

Randomized algorithms are often enjoyed for their simplicity, but the hash functions used to yield the desired theoretical guarantees are often neither simple nor practical. Here we show that the simplest possible tabulation hashing provides unexpectedly strong guarantees. The scheme itself dates back to Zobrist in 1970 who used it for game playing programs. Keys are viewed as consisting of c characters. We initialize c tables H1, ..., Hc mapping characters to random hash codes. A key x = (x1, ..., xc) is hashed to H1[x1] ⊕ ⋯ ⊕ Hc[xc], where ⊕ denotes bit-wise exclusive-or. While this scheme is not even 4-independent, we show that it provides many of the guarantees that are normally obtained via higher independence, for example, Chernoff-type concentration, min-wise hashing for estimating set intersection, and cuckoo hashing.