Practical Algorithms for the Longest Common Extension Problem

Authors:
Lucian Ilie;Liviu Tinta
Affiliations:
Department of Computer Science, University of Western Ontario, London, Canada N6A 5B7;Department of Computer Science, University of Western Ontario, London, Canada N6A 5B7
Venue:
SPIRE '09 Proceedings of the 16th International Symposium on String Processing and Information Retrieval
Year:
2009

Citing 14
Cited 4

Fast algorithms for finding nearest common ancestors

SIAM Journal on Computing
An O(n log n) algorithm for finding all repetitions in a string

Journal of Algorithms
Introducing efficient parallelism into approximate string matching and a new serial algorithm

STOC '86 Proceedings of the eighteenth annual ACM symposium on Theory of computing
On finding lowest common ancestors: simplification and parallelization

SIAM Journal on Computing
Recursive star-tree parallel data structure

SIAM Journal on Computing
Suffix arrays: a new method for on-line string searches

SIAM Journal on Computing
Algorithms on strings, trees, and sequences: computer science and computational biology

Algorithms on strings, trees, and sequences: computer science and computational biology
The LCA Problem Revisited

LATIN '00 Proceedings of the 4th Latin American Symposium on Theoretical Informatics
Linear-Time Longest-Common-Prefix Computation in Suffix Arrays and Its Applications

CPM '01 Proceedings of the 12th Annual Symposium on Combinatorial Pattern Matching
Replacing suffix trees with enhanced suffix arrays

Journal of Discrete Algorithms - SPIRE 2002
Linear time algorithms for finding and representing all the tandem repeats in a string

Journal of Computer and System Sciences
Simple linear work suffix array construction

ICALP'03 Proceedings of the 30th international conference on Automata, languages and programming
Theoretical and practical improvements on the RMQ-Problem, with applications to LCA and LCE

CPM'06 Proceedings of the 17th Annual conference on Combinatorial Pattern Matching
A modification of the landau-vishkin algorithm computing longest common extensions via suffix arrays

BSB'05 Proceedings of the 2005 Brazilian conference on Advances in Bioinformatics and Computational Biology

The longest common extension problem revisited and applications to approximate string searching

Journal of Discrete Algorithms
A fast longest common subsequence algorithm for similar strings

LATA'10 Proceedings of the 4th international conference on Language and Automata Theory and Applications
New simple efficient algorithms computing powers and runs in strings

Discrete Applied Mathematics
Extracting powers and periods in a word from its runs structure

Theoretical Computer Science

Quantified Score

Hi-index	0.00

Visualization

Abstract

The Longest Common Extension problem considers a string s and computes, for each of a number of pairs (i ,j ), the longest substring of s that starts at both i and j . It appears as a subproblem in many fundamental string problems and can be solved by linear-time preprocessing of the string that allows (worst-case) constant-time computation for each pair. The two known approaches use powerful algorithms: either constant-time computation of the Lowest Common Ancestor in trees or constant-time computation of Range Minimum Queries (RMQ) in arrays. We show here that, from practical point of view, such complicated approaches are not needed. We give two very simple algorithms for this problem that require no preprocessing. The first needs only the string and is significantly faster than all previous algorithms on the average. The second combines the first with a direct RMQ computation on the Longest Common Prefix array. It takes advantage of the superior speed of the cache memory and is the fastest on virtually all inputs.