Dynamic Fully-Compressed Suffix Trees

  • Authors:
  • Luís M. Russo;Gonzalo Navarro;Arlindo L. Oliveira

  • Affiliations:
  • INESC-ID / IST, Lisboa, Portugal 1000 and Dept. of Computer Science, University of Lisbon, Portugal;Dept. of Computer Science, University of Chile,;INESC-ID / IST, Lisboa, Portugal 1000

  • Venue:
  • CPM '08 Proceedings of the 19th annual symposium on Combinatorial Pattern Matching
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Suffix trees are by far the most important data structure in stringology, with myriads of applications in fields like bioinformatics, data compression and information retrieval. Classical representations of suffix trees require O(nlogn) bits of space, for a string of size n. This is considerably more than the nlog2茂戮驴bits needed for the string itself, where 茂戮驴is the alphabet size. The size of suffix trees has been a barrier to their wider adoption in practice. A recent so-called fully-compressed suffix tree (FCST) requires asymptotically only the space of the text entropy. FCSTs, however, have the disadvantage of being static, not supporting updates to the text. In this paper we show how to support dynamic FCSTs within the same optimal space of the static version and executing all the operations in polylogarithmic time. In particular, we are able to build the suffix tree within optimal space.