A Knowledge-Based Multiple-Sequence Alignment Algorithm

Authors:
Ken D. Nguyen;Yi Pan
Affiliations:
Clayton State University, Morrow;Georgia State University, Atlanta
Venue:
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Year:
2013

Citing 7
Cited 0

The multiple sequence alignment problem in biology

SIAM Journal on Applied Mathematics
A time-efficient, linear-space local similarity algorithm

Advances in Applied Mathematics
Combining partial order alignment and progressive multiple sequence alignment increases alignment speed and scalability to very large alignment problems

Bioinformatics
SABmark---a benchmark for sequence alignment that covers the entire known fold space

Bioinformatics
Multiple sequence alignment based on profile alignment of intermediate sequences

RECOMB'07 Proceedings of the 11th annual international conference on Research in computational molecular biology
Multiple sequence alignment based on dynamic weighted guidance tree

International Journal of Bioinformatics Research and Applications
Multiple sequence alignment using the Hidden Markov Model trained by an improved quantum-behaved particle swarm optimization

Information Sciences: an International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

A common and cost-effective mechanism to identify the functionalities, structures, or relationships between species is multiple-sequence alignment, in which DNA/RNA/protein sequences are arranged and aligned so that similarities between sequences are clustered together. Correctly identifying and aligning these sequence biological similarities help from unwinding the mystery of species evolution to drug design. We present our knowledge-based multiple sequence alignment (KB-MSA) technique that utilizes the existing knowledge databases such as SWISSPROT, GENBANK, or HOMSTRAD to provide a more realistic and reliable sequence alignment. We also provide a modified version of this algorithm (CB-MSA) that utilizes the sequence consistency information when sequence knowledge databases are not available. Our benchmark tests on BAliBASE, PREFAB, HOMSTRAD, and SABMARK references show accuracy improvements up to 10 percent on twilight data sets against many leading alignment tools such as ISPALIGN, PADT, CLUSTALW, MAFFT, PROBCONS, and T-COFFEE.