A data parallel strategy for aligning multiple biological sequences on multi-core computers

  • Authors:
  • Xiangyuan Zhu;Kenli Li;Ahmad Salah

  • Affiliations:
  • College of Information Science and Engineering, Hunan University, Changsha, Hunan 410082, China and Education Technology and Computer Center, Zhaoqing University, Zhaoqing, Guangdong 516061, China;College of Information Science and Engineering, Hunan University, Changsha, Hunan 410082, China and National Super-computer Center in Changsha, Hunan University, Changsha, Hunan 410082, China;College of Information Science and Engineering, Hunan University, Changsha, Hunan 410082, China and Department of Computer Science, Zagazig University, Zagazig, Sharkia 44519, Egypt

  • Venue:
  • Computers in Biology and Medicine
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we address the large-scale biological sequence alignment problem, which has an increasing demand in computational biology. We employ data parallelism paradigm that is suitable for handling large-scale processing on multi-core computers to achieve a high degree of parallelism. Using the data parallelism paradigm, we propose a general strategy which can be used to speed up any multiple sequence alignment method. We applied five different clustering algorithms in our strategy and implemented rigorous tests on an 8-core computer using four traditional benchmarks and artificially generated sequences. The results show that our multi-core-based implementations can achieve up to 151-fold improvements in execution time while losing 2.19% accuracy on average. The source code of the proposed strategy, together with the test sets used in our analysis, is available on request.