Compressed suffix trees for repetitive texts

  • Authors:
  • Andrés Abeliuk;Gonzalo Navarro

  • Affiliations:
  • Department of Computer Science, University of Chile, Chile,Instituto de Filosofía y Ciencias de la Complejidad, IFICC, University of Chile, Chile;Department of Computer Science, University of Chile, Chile

  • Venue:
  • SPIRE'12 Proceedings of the 19th international conference on String Processing and Information Retrieval
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

We design a new compressed suffix tree specifically tailored to highly repetitive text collections. This is particularly useful for sequence analysis on large collections of genomes of the close species. We build on an existing compressed suffix tree that applies statistical compression, and modify it so that it works on the grammar-compressed version of the longest common prefix array, whose differential version inherits much of the repetitiveness of the text.