Uniform Hashing in Constant Time and Optimal Space

Authors:
Anna Pagh;Rasmus Pagh
Affiliations:
annao@itu.dk and pagh@itu.dk;-
Venue:
SIAM Journal on Computing
Year:
2008

Citing 0
Cited 9

Efficient access enforcement in distributed role-based access control (RBAC) deployments

Proceedings of the 14th ACM symposium on Access control models and technologies
Applications of a Splitting Trick

ICALP '09 Proceedings of the 36th International Colloquium on Automata, Languages and Programming: Part I
An optimal algorithm for the distinct elements problem

Proceedings of the twenty-ninth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Fast Manhattan sketches in data streams

Proceedings of the twenty-ninth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Parallel evaluation of conjunctive queries

Proceedings of the thirtieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Fast moment estimation in data streams in optimal space

Proceedings of the forty-third annual ACM symposium on Theory of computing
Explicit and efficient hash families suffice for cuckoo hashing with a stash

ESA'12 Proceedings of the 20th Annual European conference on Algorithms
Hardness preserving reductions via cuckoo hashing

TCC'13 Proceedings of the 10th theory of cryptography conference on Theory of Cryptography
Efficient sampling of non-strict turnstile data streams

FCT'13 Proceedings of the 19th international conference on Fundamentals of Computation Theory

Quantified Score

Hi-index	0.00

Visualization

Abstract

Many algorithms and data structures employing hashing have been analyzed under the uniform hashing assumption, i.e., the assumption that hash functions behave like truly random functions. Starting with the discovery of universal hash functions, many researchers have studied to what extent this theoretical ideal can be realized by hash functions that do not take up too much space and can be evaluated quickly. In this paper we present an almost ideal solution to this problem: a hash function $h: U\rightarrow V$ that, on any set of $n$ inputs, behaves like a truly random function with high probability, can be evaluated in constant time on a RAM and can be stored in $(1+\epsilon)n\log |V| + O(n+\log\log |U|)$ bits. Here $\epsilon$ can be chosen to be any positive constant, so this essentially matches the entropy lower bound. For many hashing schemes this is the first hash function that makes their uniform hashing analysis come true, with high probability, without incurring overhead in time or space.