Multiple sequence alignment based on profile alignment of intermediate sequences
RECOMB'07 Proceedings of the 11th annual international conference on Research in computational molecular biology
A novel approach to multiple sequence alignment using hadoop data grids
Proceedings of the 2010 Workshop on Massive Data Analytics on the Cloud
A novel approach to Multiple Sequence Alignment using hadoop data grids
International Journal of Bioinformatics Research and Applications
The Impact of Multiple Protein Sequence Alignment on Phylogenetic Estimation
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Proceedings of the ATIP/A*CRC Workshop on Accelerator Technologies for High-Performance Computing: Does Asia Lead the Way?
A data parallel strategy for aligning multiple biological sequences on multi-core computers
Computers in Biology and Medicine
Scalability and accuracy improvements of consistency-based multiple sequence alignment tools
Proceedings of the 20th European MPI Users' Group Meeting
GLProbs: Aligning multiple sequences adaptively
Proceedings of the International Conference on Bioinformatics, Computational Biology and Biomedical Informatics
ChainKnot: a comparative H-type pseudoknot prediction tool using multiple ab initio folding tools
Proceedings of the International Conference on Bioinformatics, Computational Biology and Biomedical Informatics
Hi-index | 3.84 |
Motivation: The maximum expected accuracy optimization criterion for multiple sequence alignment uses pairwise posterior probabilities of residues to align sequences. The partition function methodology is one way of estimating these probabilities. Here, we combine these two ideas for the first time to construct maximal expected accuracy sequence alignments. Results: We bridge the two techniques within the program Probalign. Our results indicate that Probalign alignments are generally more accurate than other leading multiple sequence alignment methods (i.e. Probcons, MAFFT and MUSCLE) on the BAliBASE 3.0 protein alignment benchmark. Similarly, Probalign also outperforms these methods on the HOMSTRAD and OXBENCH benchmarks. Probalign ranks statistically highest (P-value 300 and 400, respectively. Availability: Open source code implementing Probalign as well as for producing the simulated data, and all real and simulated data are freely available from http://www.cs.njit.edu/usman/probalign Contact: usman@cs.njit.edu