Text algorithms
Algorithms on strings, trees, and sequences: computer science and computational biology
Algorithms on strings, trees, and sequences: computer science and computational biology
On the combinatorics of finite words
Theoretical Computer Science
Complexity for finite factors of infinite sequences
Theoretical Computer Science
Algorithms for the Longest Common Subsequence Problem
Journal of the ACM (JACM)
Efficient reconstruction of sequences from their subsequences or supersequences
Journal of Combinatorial Theory Series A
A Survey of Longest Common Subsequence Algorithms
SPIRE '00 Proceedings of the Seventh International Symposium on String Processing Information Retrieval (SPIRE'00)
Combinatorial Representations of Token Sequences
Journal of Classification
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
CPM'06 Proceedings of the 17th Annual conference on Combinatorial Pattern Matching
A note on double insertion/deletion correcting codes
IEEE Transactions on Information Theory
Measuring tree similarity for natural language processing based information retrieval
NLDB'10 Proceedings of the Natural language processing and information systems, and 15th international conference on Applications of natural language to information systems
Information Sciences: an International Journal
Pattern Recognition Letters
Hi-index | 5.23 |
A subsequence is obtained from a string by deleting any number of characters; thus in contrast to a substring, a subsequence is not necessarily a contiguous part of the string. Counting subsequences under various constraints has become relevant to biological sequence analysis, to machine learning, to coding theory, to the analysis of categorical time series in the social sciences, and to the theory of word complexity. We present theorems that lead to efficient dynamic programming algorithms to count (1) distinct subsequences in a string, (2) distinct common subsequences of two strings, (3) matching joint embeddings in two strings, (4) distinct subsequences with a given minimum span, and (5) sequences generated by a string allowing characters to come in runs of a length that is bounded from above.