A space-efficient algorithm for sequence alignment with inversions and reversals

  • Authors:
  • Zhi-Zhong Chen;Yong Gao;Guohui Lin;Robert Niewiadomski;Yang Wang;Junfeng Wu

  • Affiliations:
  • Mathematical Sciences, Tokyo Denki University, Hatoyama, Saitama 350-0394, Japan;Department of Computing Science, Faculty of Sciences, University of Alberta, 2-21 Athabasca Hall, Edmonton, Alberta, Canada T6G 2E8;Department of Computing Science, Faculty of Sciences, University of Alberta, 2-21 Athabasca Hall, Edmonton, Alberta, Canada T6G 2E8;Department of Computing Science, Faculty of Sciences, University of Alberta, 2-21 Athabasca Hall, Edmonton, Alberta, Canada T6G 2E8;Department of Computing Science, Faculty of Sciences, University of Alberta, 2-21 Athabasca Hall, Edmonton, Alberta, Canada T6G 2E8;Department of Computing Science, Faculty of Sciences, University of Alberta, 2-21 Athabasca Hall, Edmonton, Alberta, Canada T6G 2E8

  • Venue:
  • Theoretical Computer Science - Special papers from: COCOON 2003
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

A dynamic programming algorithm to find an optimal alignment for a pair of DNA sequences has been described by Schöniger and Waterman. The alignments use not only substitutions, insertions, and deletions of single nucleotides, but also inversions, which are the reversed complements, of substrings of the sequences. With the restriction that the inversions are pairwise non-intersecting, their proposed algorithm runs in O(n2m2) time and consumes O(n2m2) space, where n and m are the lengths of the input sequences, respectively. We develop a space-efficient algorithm to compute such an optimal alignment which consumes only O(nm) space within the same amount of time. Our algorithm enables the computation for a pair of DNA sequences of length up to 10,000 to be carried out on an ordinary desktop computer. Simulation study is conducted to verify some biological facts about gene shuffling across species.