A linear time algorithm for seeds computation

  • Authors:
  • Tomasz Kociumaka;Marcin Kubica;Jakub Radoszewski;Wojciech Rytter;Tomasz Waleń

  • Affiliations:
  • University of Warsaw, Warsaw, Poland;University of Warsaw, Warsaw, Poland;University of Warsaw, Warsaw, Poland;University of Warsaw, Warsaw, Poland, and Nicolaus Copernicus University, Toruń, Poland;University of Warsaw, Warsaw, Poland, and International Institute of Molecular and Cell Biology in Warsaw, Poland

  • Venue:
  • Proceedings of the twenty-third annual ACM-SIAM symposium on Discrete Algorithms
  • Year:
  • 2012

Quantified Score

Hi-index 0.01

Visualization

Abstract

A seed in a word is a relaxed version of a period. We show a linear time algorithm computing a compact representation of all the seeds of a word, in particular, the shortest seed. Thus, we solve an open problem stated in the survey by Smyth (2000) and improve upon a previous over 15-year old O(n log n) algorithm by Iliopoulos, Moore and Park (1996). Our approach is based on combinatorial relations between seeds and a variant of the LZ-factorization (used here for the first time in context of seeds).