Application of max-plus algebra to biological sequence comparisons

  • Authors:
  • J.-P. Comet

  • Affiliations:
  • LAMI, CNRS UMR 8042, University of Evry, Tour Evry 2, 523 Place des terrasses de l'agora, 91025 Évry Cedex, France

  • Venue:
  • Theoretical Computer Science
  • Year:
  • 2003

Quantified Score

Hi-index 5.23

Visualization

Abstract

The classical algorithms to align two biological sequences (Needleman and Wunsch and Smith and Waterman algorithms) can be seen as a sequence of elementary operations in (max, +) algebra: each line (viewed as a vector) of the dynamic programming table of the alignment algorithms can be deduced by a (max, +) multiplication of the previous line by a matrix. Taking into account the properties of these matrices there are only a finite number of nonproportional vectors. The use of this algebra allows one to imagine a faster equivalent algorithm. One can construct an automaton and afterwards skim through the sequence databank with this automaton in linear time. Unfortunately, the size of the automaton prevents using this approach for comparing global proteins. However, biologists frequently face the problem of comparing one short string against many others sequences. In that case this automaton version of dynamic programming results in a new algorithm which works faster than the classical algorithm.