Generalized function matching

  • Authors:
  • Amihood Amir;Igor Nor

  • Affiliations:
  • College of Computing, Georgia Tech, Atlanta, GA;Department of Computer Science, Bar-Ilan University, Israel

  • Venue:
  • ISAAC'04 Proceedings of the 15th international conference on Algorithms and Computation
  • Year:
  • 2004
  • Generalised Matching

    SPIRE '09 Proceedings of the 16th International Symposium on String Processing and Information Retrieval

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present problems in different application areas: tandem repeats (computational biology), poetry and music analysis, and author validation, that require a more sophisticated pattern matching model that hitherto considered. We introduce a new matching criterion – generalized function matching – that encapsulates the notion suggested by the above problems The generalized function matching problem has as its input a text T of length n over alphabet ΣT ∪ {φ } and a pattern P=P[0]P[1]...P[m−1] of length m over alphabet ΣP ∪ { φ } We seek all text locations i where the prefix of the substring that starts at i is equal to f(P[0])f(P[1])...f(P[m–1]) for some function f: ΣP → ΣT*. We give a polynomial time algorithm for the generalized pattern matching problem over bounded alphabets We identify in this problem an important new phenomenon in pattern matching One where there is a significant complexity difference between the bounded alphabet and infinite alphabet case We prove that the generalized pattern matching problem over infinite alphabets is ${\mathcal NP}$-hard To our knowledge, this is the first case in the literature where a pattern matching problem over a bounded alphabet can be solved in polynomial time but the infinite alphabet version is ${\mathcal NP}$-hard.