Finding LPRs in DNA Sequence Based on a New Index — SUA

  • Authors:
  • Di Wang;Guoren Wang;Qingquan Wu;Baichen Chen

  • Affiliations:
  • Northeastern University;Northeastern University;Northeastern University and Shanghai Baosight Ltd.;Northeastern University

  • Venue:
  • BIBE '05 Proceedings of the Fifth IEEE Symposium on Bioinformatics and Bioengineering
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper proposes a new concept of repetitions, the Largest Pattern Repetition (the LPR) and a concept of pattern unit. A lightweight index structure, namely, the Succeeding Unit Array (the SUA) is designed based on pattern unit. The SUA decreases the space consumption efficiently and solves the space bottleneck in the search of repetitions. On the SUA all the atomic patterns which constitute the LPRs can be detected and the LPRs can be identified by connecting the same patterns. The theoretical analysis and experimental results show that both space and time complexity of the algorithms is O(n).