On (ε,k)-min-wise independent permutations

  • Authors:
  • Noga Alon;Toshiya Itoh;Tatsuya Nagatani

  • Affiliations:
  • School of Mathematics and Computer Science, Raymond and Beverly Sackler Faculty of Exact Science, Tel Aviv University, Tel Aviv 69978, Israel;Global Scientific Information and Computing Center, Tokyo Institute of Technology, 2–12–1 O-okayama, Meguro-ku, Tokyo 152–8550, Japan;Advanced Technology R&D Center, Mitsubishi Electric Corporation, 8––1–1, Tsukaguchi-Honmachi, Amagasaki, Hyogo 661–8661, Japan

  • Venue:
  • Random Structures & Algorithms
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

A family of permutations of $\cal{F}$ [n] = {1,2,…,n} is (ε,k)-min-wise independent if for every nonempty subset X of at most k elements of [n], and for any x ∈ X, the probability that in a random element π of $\cal{F}$, π(x) is the minimum element of π(X), deviates from 1/∣X∣ by at most ε/∣X∣. This notion can be defined for the uniform case, when the elements of $\cal{F}$ are picked according to a uniform distribution, or for the more general, biased case, in which the elements of $\cal{F}$ are chosen according to a given distribution D. It is known that this notion is a useful tool for indexing replicated documents on the web. We show that even in the more general, biased case, for all admissible k and ε and all large n, the size of $\cal{F}$ must satisfy $$|{\cal{F}}| \ge \Omega \left({k \over \varepsilon^2\log(1/\varepsilon)} \log n\right),$$ as well as $$|{\cal{F}}| \ge \Omega \left({k^2 \over \varepsilon\log(1/\varepsilon)} \log n\right).$$ This improves the best known previous estimates even for the uniform case. © 2007 Wiley Periodicals, Inc. Random Struct. Alg., 2007