Regular expression constrained sequence alignment

Authors:
Abdullah N. Arslan
Affiliations:
Department of Computer Science, The University of Vermont, Burlington, VT
Venue:
CPM'05 Proceedings of the 16th annual conference on Combinatorial Pattern Matching
Year:
2005

Citing 6
Cited 3

Introduction To Automata Theory, Languages, And Computation

Introduction To Automata Theory, Languages, And Computation
Constrained Multiple Sequence Alignment Tool Development and Its Application to RNase Family Alignment

CSB '02 Proceedings of the IEEE Computer Society Conference on Bioinformatics
Efficient Constrained Multiple Sequence Alignment with Performance Guarantee

CSB '03 Proceedings of the IEEE Computer Society Conference on Bioinformatics
The constrained longest common subsequence problem

Information Processing Letters
A simple algorithm for the constrained sequence problems

Information Processing Letters
MuSiC: a tool for multiple sequence alignment with constraints

Bioinformatics

Efficient algorithms for regular expression constrained sequence alignment

Information Processing Letters
Constrained sequence alignment: A general model and the hardness results

Discrete Applied Mathematics
Efficient algorithms for regular expression constrained sequence alignment

CPM'06 Proceedings of the 17th Annual conference on Combinatorial Pattern Matching

Quantified Score

Hi-index	0.00

Visualization

Abstract

Given strings S1, S2, and a regular expression R, we introduce regular expression constrained sequence alignment as the problem of finding the maximum alignment score between S1 and S2 over all alignments such that in these alignments there exists a segment where some substring s1 of S1 is aligned with some substring s2 of S2, and both s1 and s2 match R, i.e. s1,s2 ∈ L(R) where L(R) is the regular language described by R. A motivation for the problem is that protein sequences can be aligned in a way that known motifs guide the alignments. We present an O(nmr) time algorithm for the regular expression constrained sequence alignment problem where n, and m are the lengths of S1, and S2, respectively, and r is in the order of the size of the transition function of a finite automaton M that we create from a nondeterministic finite automaton N accepting L(R). M contains O(t2) states if N has t states.