An index structure for spaced seed search

  • Authors:
  • Taku Onodera;Tetsuo Shibuya

  • Affiliations:
  • Human Genome Center, Institute of Medical Science, University of Tokyo, Tokyo, Japan;Human Genome Center, Institute of Medical Science, University of Tokyo, Tokyo, Japan

  • Venue:
  • ISAAC'11 Proceedings of the 22nd international conference on Algorithms and Computation
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we introduce an index structure of texts which supports fast search of patterns with "don't care"s in predetermined positions. This data structure is a generalization of the suffix array and has many applications especially for computational biology. We propose three algorithms to construct the index. Two of them are based on a variant of radix sort but each utilizes different types of referential information to sort suffixes by multiple characters at a time. The other is for the case when "don't care"s appear periodically in patterns and can be combined with the others.