DNA fragment assembly using a grid-based genetic algorithm

  • Authors:
  • A. J. Nebro;G. Luque;F. Luna;E. Alba

  • Affiliations:
  • Departamento de Lenguajes y Ciencias de la Computación, E.T.S.I. Infomática, University of Málaga, Boulevard Louis Pasteur 29071, Spain;Departamento de Lenguajes y Ciencias de la Computación, E.T.S.I. Infomática, University of Málaga, Boulevard Louis Pasteur 29071, Spain;Departamento de Lenguajes y Ciencias de la Computación, E.T.S.I. Infomática, University of Málaga, Boulevard Louis Pasteur 29071, Spain;Departamento de Lenguajes y Ciencias de la Computación, E.T.S.I. Infomática, University of Málaga, Boulevard Louis Pasteur 29071, Spain

  • Venue:
  • Computers and Operations Research
  • Year:
  • 2008

Quantified Score

Hi-index 0.01

Visualization

Abstract

In this paper we propose a genetic algorithm (GA) for solving the DNA fragment assembly problem in a computational grid. The algorithm, which is named GrEA, is a steady-state GA which uses a panmitic population, and it is based on computing parallel function evaluations in an asynchronous way. We have implemented GrEA on top of the Condor system, and we have used it to solve the DNA assembly problem. This is an NP-hard combinatorial optimization problem which is growing in importance and complexity as more research centers become involved on sequencing new genomes. While previous works on this problem have usually faced 30K base pairs (bps) long instances, we have tackled here a 77K bps long one to show how a grid system can move research forward. After analyzing the basic grid algorithm, we have studied the use of an improvement method to still enhance its scalability. Then, by using a grid composed of up to 150 computers, we have achieved time reductions from tens of days down to a few hours, and we have obtained near optimal solutions when solving the 77K bps long instance (773 fragments). We conclude that our proposal is a promising approach to take advantage of a grid system to solve large DNA fragment assembly problem instances and also to learn more about grid metaheuristics as a new class of algorithms for really challenging problems.