Multiple sequence alignment in parallel on a workstation cluster

  • Authors:
  • Justin Ebedes;Amitava Datta

  • Affiliations:
  • School of Computer Science and Software Engineering, University of Western Australia, Perth, WA 6009, Australia;School of Computer Science and Software Engineering, University of Western Australia, Perth, WA 6009, Australia

  • Venue:
  • Bioinformatics
  • Year:
  • 2004

Quantified Score

Hi-index 3.84

Visualization

Abstract

Summary: Multiple sequence alignment is the NP-hard problem of aligning three or more DNA or amino acid sequences in an optimal way so as to match as many characters as possible from the set of sequences. The popular sequence alignment program ClustalW uses the classical method of approximating a sequence alignment, by first computing a distance matrix and then constructing a guide tree to show the evolutionary relationship of the sequences. We show that parallelizing the ClustalW algorithm can result in significant speedup. We used a cluster of workstations using C and message passing interface for our implementation. Experimental results show that speedup of over 5.5 on six processors is obtainable for most inputs. Availability: The software is available upon request from the second author.