The SCP and Compressed Domain Analysis of Biological Sequences

  • Authors:
  • Donald Adjeroh;Jianan Feng

  • Affiliations:
  • -;-

  • Venue:
  • CSB '03 Proceedings of the IEEE Computer Society Conference on Bioinformatics
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

We introduce the SCP - the sortedcommon prefix, and study some of its properties.Based on the internal representations used by aclass of new compression schemes, we show howthe SCP table can be constructed using anO(u + |Sigma|\kappamax) number of comparisons onaverage, and O(u|\Sigma|) worst case, where u is thesize of the sequence, |\Sigma| is the number of symbols,and \kappmax is the maximum SCP value.Wedescribe one application of the SCP to the problemof anchor points in multiple sequence alignment.