A parallel algorithm for multiple biological sequence alignment

  • Authors:
  • Irma R. Andalon-Garcia;Arturo Chavoya;M. E. Meda-Campaña

  • Affiliations:
  • Department of Information Systems, Universidad de Guadalajara, Zapopan, Jal., Mexico;Department of Information Systems, Universidad de Guadalajara, Zapopan, Jal., Mexico;Department of Information Systems, Universidad de Guadalajara, Zapopan, Jal., Mexico

  • Venue:
  • IPCAT'12 Proceedings of the 9th international conference on Information Processing in Cells and Tissues
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

The search of a multiple sequence alignment (MSA) is a well-known problem in bioinformatics that consists in finding a sequence alignment of three or more biological sequences. In this paper, we propose a parallel iterative algorithm for the global alignment of multiple biological sequences. In this algorithm, a number of processes work independently at the same time searching for the best MSA of a set of sequences. It uses a Longest Common Subsequence (LCS) technique in order to generate a first MSA. An iterative process improves the MSA by applying a number of operators that have been implemented to produce more accurate alignments. Simulations were made using sequences from the UniProKB protein database. A preliminary performance analysis and comparison with several common methods for MSA shows promising results. The implementation was developed on a cluster platform through the use of the standard Message Passing Interface (MPI) library.