A Fast Algorithm on Average for All-Against-All Sequence Matching

  • Authors:
  • Ricardo A. Baeza-Yates;Gaston H. Gonnet

  • Affiliations:
  • -;-

  • Venue:
  • SPIRE '99 Proceedings of the String Processing and Information Retrieval Symposium & International Workshop on Groupware
  • Year:
  • 1999

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we present an algorithm which attempts to align pairs of subsequences from a database of genetic sequences. The algorithm simulates the classical dynamic programming alignment algorithm over a suffix array of the database. We provide a detailed average case analysis which shows that the running time of the algorithm is sub-quadratic with respect to the database size. A similar algorithm solves the approximate string matching problem in sub-linear average time.