Faster index for property matching

  • Authors:
  • Costas S. Iliopoulos;M. Sohel Rahman

  • Affiliations:
  • Algorithm Design Group, Department of Computer Science, King's College London, Strand, London WC2R 2LS, England, UK;Algorithm Design Group, Department of Computer Science, King's College London, Strand, London WC2R 2LS, England, UK

  • Venue:
  • Information Processing Letters
  • Year:
  • 2008

Quantified Score

Hi-index 0.89

Visualization

Abstract

In this paper, we revisit the Property Matching problem studied by Amir et al. [Property Matching and Weighted Matching, CPM 2006] and present a better indexing scheme for the problem. In particular, the data structure by Amir et al., namely PST, requires O(nlog|@S|+nloglogn) construction time and O(mlog|@S|+K) query time, where n and m are the length of, respectively, the text and the pattern, @S is the alphabet and K is the output size. On the other hand, the construction time of our data structure, namely IDS_PIP, is dominated by the suffix tree construction time and hence is O(n) time for alphabets that are natural numbers from 1 to a polynomial in n and O(nlog@s) time otherwise, where @s=min(n,|@S|). The query time is same as that of PST. Also, IDS_PIP has the advantage that it can be built on either a suffix tree or a suffix array and additionally, it retains the capability of answering normal pattern matching queries.