An extension of the burrows wheeler transform and applications to sequence comparison and data compression

  • Authors:
  • Sabrina Mantaci;Antonio Restivo;G. Rosone;Marinella Sciortino

  • Affiliations:
  • Dipartimento di Matematica ed Application, University of Palermo, Palermo, Italy;Dipartimento di Matematica ed Application, University of Palermo, Palermo, Italy;Dipartimento di Matematica ed Application, University of Palermo, Palermo, Italy;Dipartimento di Matematica ed Application, University of Palermo, Palermo, Italy

  • Venue:
  • CPM'05 Proceedings of the 16th annual conference on Combinatorial Pattern Matching
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

We introduce a generalization of the Burrows-Wheeler Transform (BWT) that can be applied to a multiset of words. The extended transformation, denoted by E, is reversible, but, differently from BWT, it is also surjective. The E transformation allows to give a definition of distance between two sequences, that we apply here to the problem of the whole mitochondrial genome phylogeny. Moreover we give some consideration about compressing a set of words by using the E transformation as preprocessing.