Genetic Algorithms, Operators, and DNA Fragment Assembly

  • Authors:
  • Rebecca J. Parsons;Stephanie Forrest;Christian Burks

  • Affiliations:
  • Los Alamos National Laboratory. Current Address: University of Central Florida, Department of Computer Science, Orlando, FL 32816-0362. rebecca@cs.ucf.edu;University of New Mexico, Department of Computer Science, Albuquerque, NM 87131-1386. forrest@cs.unm.edu;Los Alamos National Laboratory, Theoretical Biology and Biophysics Group, MS K710, Los Alamos, NM 87545. cb@t10.lanl.gov

  • Venue:
  • Machine Learning - Special issue on applications in molecular biology
  • Year:
  • 1995

Quantified Score

Hi-index 0.00

Visualization

Abstract

We study different genetic algorithm operators for one permutation problem associated with the Human Genome Project—the assembly of DNA sequence fragments from a parent clone whose sequence is unknown into a consensus sequence corresponding to the parent sequence. The sorted-order representation, which does not require specialized operators, is compared with a more traditional permutation representation, which does require specialized operators. The two representations and their associated operators are compared on problems ranging from 2K to 34K base pairs (KB). Edge-recombination crossover used in conjunction with several specialized operators is found to perform best in these experiments; these operators solved a 10KB sequence, consisting of 177 fragments, with no manual intervention. Natural building blocks in the problem are exploited at progressively higher levels through “macro-operators.” This significantly improves performance.