Cross-Document pattern matching

  • Authors:
  • Gregory Kucherov;Yakov Nekrich;Tatiana Starikovskaya

  • Affiliations:
  • Laboratoire d'Informatique Gaspard Monge, Université Paris-Est & CNRS, Paris, France;Department of Computer Science, University of Chile, Santiago, Chile;Lomonosov Moscow State University, Moscow, Russia,Laboratoire d'Informatique Gaspard Monge, Université Paris-Est & CNRS, Paris, France

  • Venue:
  • CPM'12 Proceedings of the 23rd Annual conference on Combinatorial Pattern Matching
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

We study a new variant of the string matching problem called cross-document string matching, which is the problem of indexing a collection of documents to support an efficient search for a pattern in a selected document, where the pattern itself is a substring of another document. Several variants of this problem are considered, and efficient linear-space solutions are proposed with query time bounds that either do not depend at all on the pattern size or depend on it in a very limited way (doubly logarithmic). As a side result, we propose an improved solution to the weighted level ancestor problem.