Let sleeping files lie: pattern matching in Z-compressed files
Journal of Computer and System Sciences
Algorithms on strings, trees, and sequences: computer science and computational biology
Algorithms on strings, trees, and sequences: computer science and computational biology
Introduction to Algorithms
Faster Algorithms for String Matching Problems: Matching the Convolution Bound
FOCS '98 Proceedings of the 39th Annual Symposium on Foundations of Computer Science
A Subquadratic Sequence Alignment Algorithm for Unrestricted Scoring Matrices
SIAM Journal on Computing
Indexing text using the Ziv-Lempel trie
Journal of Discrete Algorithms - SPIRE 2002
Information Processing Letters
Simple deterministic wildcard matching
Information Processing Letters
Matching with don't-cares and a small number of mismatches
Information Processing Letters
From coding theory to efficient pattern matching
SODA '09 Proceedings of the twentieth Annual ACM-SIAM Symposium on Discrete Algorithms
Function matching: algorithms, applications, and a lower bound
ICALP'03 Proceedings of the 30th international conference on Automata, languages and programming
ESA'07 Proceedings of the 15th annual European conference on Algorithms
Compression of individual sequences via variable-rate coding
IEEE Transactions on Information Theory
Efficient LZ78 factorization of grammar compressed text
SPIRE'12 Proceedings of the 19th international conference on String Processing and Information Retrieval
Hi-index | 0.89 |
String convolution between vectors of integers representing a pattern and a text is a widely used computational primitive in string processing. In this paper, we investigate the use of an algorithmic framework which exploits sequence repetitions (identified according to the Lempel-Ziv parsing technique, i.e., the LZ78 algorithm) to speed up conventional algorithms (based on Fast Fourier Transform) for the computation of convolution between a pattern and a text, when the text is long enough and the pattern is sufficiently small. In particular, we present a deterministic algorithm which, given a text T of length n (drawn from a constant size alphabet @S"T) and a pattern P of length m (drawn from a constant size alphabet @S"P), computes the convolution between P and T with time and space complexity O(n+nmlognh), where h is the entropy of text T.