Average complexity of exact and approximate multiple string matching

  • Authors:
  • Gonzalo Navarro;Kimmo Fredriksson

  • Affiliations:
  • Department of Computer Science, University of Chile, Santiago, Chile;Department of Computer Science, University of Helsinki, Finland and Department of Computer Science, University of Joensuu, Finland

  • Venue:
  • Theoretical Computer Science
  • Year:
  • 2004

Quantified Score

Hi-index 5.23

Visualization

Abstract

We show that the average number of characters examined to search for r random patterns of length m in a text of length n over a uniformly distributed alphabet of size σ cannot be less than Ω(n logσ(rm)/m). When we permit up to k insertions, deletions, and/or substitutions of characters in the occurrences of the patterns, the lower bound becomes Ω(n(k + logσ(rm))/m). This generalizes previous single-pattern lower bounds of Yao (for exact matching) and of Chang and Marr (for approximate matching), and proves the optimality of several existing multipattern search algorithms.