Locating All Tandem Repeat Families in a Sequence

  • Authors:
  • Donald Adjeroh;Jianan Feng

  • Affiliations:
  • West Virginia University;West Virginia University

  • Venue:
  • CSB '04 Proceedings of the 2004 IEEE Computational Systems Bioinformatics Conference
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a new data structure called the BSCP (block sorted common prefix), and its tree representation, called the BSCP tree. We also introduce the notion of PTR family 驴 a biologically motivated description and representation of the tandem repetitions in a sequence. The PTR family implicitly encodes each distinct primitive tandem repeat in the sequence as its part. Based on the BSCP tree, we describe a method to locate all the primitive tandem repeat families in an input sequence T. The proposed method requires average space and time complexity in O(u), where u = |T|.