Optimal lightweight construction of suffix arrays for constant alphabets

  • Authors:
  • Ge Nong;Sen Zhang

  • Affiliations:
  • Computer Science Department, Sun Yat-Sen University, GuangZhou, PRC;Department of Mathematics, Computer Science and Statistics, SUNY College at Oneonta, NY

  • Venue:
  • WADS'07 Proceedings of the 10th international conference on Algorithms and Data Structures
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

This article presents our divide-and-conquer optimal algorithms for lightweight suffix array construction for constant alphabets. These algorithms can efficiently compute the suffix array of a size-n text T with an alphabet Σ using O(n log Σ) time and (l(T) + |Σ|log n + O(1))-bit working space (excluding the space for the output suffix array), where Σ is an integer or constant alphabet, and l(T) is the length of T measured in bits. For popular applications in practice with n ≤ 232 and |Σ| ≤ 256, these results translate into O(n) time and a total space of 5n + O(1) bytes, which are the optimal time and space complexities for lightweight suffix array construction.