Lower bounds for orthogonal range searching: I. The reporting case
Journal of the ACM (JACM)
Suffix arrays: a new method for on-line string searches
SIAM Journal on Computing
A Space-Economical Suffix Tree Construction Algorithm
Journal of the ACM (JACM)
Journal of the ACM (JACM)
Compressed Suffix Arrays and Suffix Trees with Applications to Text Indexing and String Matching
SIAM Journal on Computing
ACM Computing Surveys (CSUR)
Geometric Burrows-Wheeler Transform: Linking Range Searching and Text Indexing
DCC '08 Proceedings of the Data Compression Conference
Linear pattern matching algorithms
SWAT '73 Proceedings of the 14th Annual Symposium on Switching and Automata Theory (swat 1973)
Orthogonal range searching on the RAM, revisited
Proceedings of the twenty-seventh annual symposium on Computational geometry
CPM'11 Proceedings of the 22nd annual conference on Combinatorial pattern matching
Alphabet-independent compressed text indexing
ESA'11 Proceedings of the 19th European conference on Algorithms
Persistency in suffix trees with applications to string interval problems
SPIRE'11 Proceedings of the 18th international conference on String processing and information retrieval
Compressed text indexing with wildcards
SPIRE'11 Proceedings of the 18th international conference on String processing and information retrieval
Position-Restricted substring searching
LATIN'06 Proceedings of the 7th Latin American conference on Theoretical Informatics
Hi-index | 0.00 |
We study the position restricted substring searching (PRSS) problem, where the task is to index a text T[0...n-1] of n characters over an alphabet set @S of size @s, in order to answer the following: given a query pattern P (of length p) and two indices @? and r, report all occ"@?","r occurrences of P in T[@?...r]. Known indexes take O(nlogn) bits or O(nlog^1^+^@en) bits space, and answer this query in O(p+logn+occ"@?","rlogn) time or in optimal O(p+occ"@?","r) time respectively, where @e is any positive constant. The main drawback of these indexes is their space requirement of @W(nlogn) bits, which can be much more than the optimal nlog@s bits to store the text T. This paper addresses an open question asked by Makinen and Navarro [LATIN, 2006], which is whether it is possible to design a succinct index answering PRSS queries efficiently. We first study the hardness of this problem and prove the following result: a succinct (or a compact) index cannot answer PRSS queries efficiently in the pointer machine model, and also not in the RAM model unless bounds on the well-researched orthogonal range query problem improve. However, for the special case of sufficiently long query patterns, that is for p=@W(log^2^+^@en), we derive an |CSA"f|+|CSA"r|+o(n) bits index with optimal query time, where |CSA"f| and |CSA"r| are the space (in bits) of the compressed suffix arrays (with O(p) time for pattern search) of T and T