On Constructing Suffix Arrays in External Memory

  • Authors:
  • Andreas Crauser;Paolo Ferragina

  • Affiliations:
  • -;-

  • Venue:
  • ESA '99 Proceedings of the 7th Annual European Symposium on Algorithms
  • Year:
  • 1999

Quantified Score

Hi-index 0.01

Visualization

Abstract

The construction of full-text indexes on very large text collections is nowadays a hot problem. The suffix array [16] is one of the most attractive full-text indexing data structures due to its simplicity, space efficiency and powerful/fast search operations supported. In this paper we analyze theoretically and experimentally, the I/O-complexity and the working space of six algorithms for constructing large suffix arrays. Additionally, we design a new external-memory algorithm that follows the basic philosophy underlying the algorithm in [13] but in a significantly different manner, thus combining its good practical qualities with efficient worstcase performances. At the best of our knowledge, this is the first study which provides a wide spectrum of possible approaches to the construction of suffix arrays in external memory, and thus it should be helpful to anyone who is interested in building full-text indexes on very large text collections.