Replacing suffix trees with enhanced suffix arrays

  • Authors:
  • Mohamed Ibrahim Abouelhoda;Stefan Kurtz;Enno Ohlebusch

  • Affiliations:
  • Faculty of Computer Science, University of Ulm, 89069 Ulm, Germany;Center for Bioinformatics, University of Hamburg, 20146 Hamburg, Germany;Faculty of Computer Science, University of Ulm, 89069 Ulm, Germany

  • Venue:
  • Journal of Discrete Algorithms - SPIRE 2002
  • Year:
  • 2004

Quantified Score

Hi-index 0.01

Visualization

Abstract

The suffix tree is one of the most important data structures in string processing and comparative genomics. However, the space consumption of the suffix tree is a bottleneck in large scale applications such as genome analysis. In this article, we will overcome-this obstacle. We will show how every algorithm that uses a suffix tree as data structure can systematically be replaced with an algorithm that uses an enhanced suffix array and solves the same problem in the same time complexity. The generic name enhanced suffix array stands for data structures consisting of the suffix array and additional tables. Our new algorithms are not only more space efficient than previous ones, but they are also faster and easier to implement.