Tight bounds for string reconstruction using substring queries

  • Authors:
  • Dekel Tsur

  • Affiliations:
  • University of California, San Diego

  • Venue:
  • APPROX'05/RANDOM'05 Proceedings of the 8th international workshop on Approximation, Randomization and Combinatorial Optimization Problems, and Proceedings of the 9th international conference on Randamization and Computation: algorithms and techniques
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

We resolve two open problems presented in [8]. First, we consider the problem of reconstructing an unknown string T over a fixed alphabet using queries of the form “does the string S appear in T?” for some query string S. We show that Ω(ε−−1/2n2) queries are needed in order to reconstruct a 1–ε fraction of the strings of length n. This lower bound is asymptotically optimal since it is known that O(ε−−1/2n2) queries are sufficient. The second problem is reconstructing a string using queries of the form “does a string from $\mathcal{S}$ appear in T?”, where $\mathcal{S}$ is a set of strings. We show that a 1–ε fraction of the strings of length n can be reconstructed using O(n) queries, where the maximum length of a string in the queries is $2\log_{\sigma}n+\log_{\sigma}\frac{1}{\epsilon}+O(1)$. This construction is optimal up to constants.