Multiple alignment by sequence annealing

  • Authors:
  • Ariel S. Schwartz;Lior Pachter

  • Affiliations:
  • EECS, Computer Science Division, University of California Berkeley, CA 94720, USA;Department of Mathematics, University of California Berkeley, CA 94720, USA

  • Venue:
  • Bioinformatics
  • Year:
  • 2007

Quantified Score

Hi-index 3.84

Visualization

Abstract

Motivation: We introduce a novel approach to multiple alignment that is based on an algorithm for rapidly checking whether single matches are consistent with a partial multiple alignment. This leads to a sequence annealing algorithm, which is an incremental method for building multiple sequence alignments one match at a time. Our approach improves significantly on the standard progressive alignment approach to multiple alignment. Results: The sequence annealing algorithm performs well on benchmark test sets of protein sequences. It is not only sensitive, but also specific, drastically reducing the number of incorrectly aligned residues in comparison to other programs. The method allows for adjustment of the sensitivity/specificity tradeoff and can be used to reliably identify homologous regions among protein sequences. Availability: An implementation of the sequence annealing algorithm is available at http://bio.math.berkeley.edu/amap/ Contact: sariel@cs.berkeley.edu