The Minimum DAWG for All Suffixes of a String and Its Applications

  • Authors:
  • Shunsuke Inenaga;Masayuki Takeda;Ayumi Shinohara;Hiromasa Hoshino;Setsuo Arikawa

  • Affiliations:
  • -;-;-;-;-

  • Venue:
  • CPM '02 Proceedings of the 13th Annual Symposium on Combinatorial Pattern Matching
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

For a string w over an alphabet 驴, we consider a composite data structure called the all-suffixes directed acyclic word graph (ASDAWG). ASDAWG(w) has |w| + 1 initial nodes, and the dag induced by all reachable nodes from the k-th initial node conforms with DAWG(w[k :]), where w[k :] denotes the k-th suffix of w. We prove that the size of the minimum ASDAWG(w) (MASDAWG(w)) is 驴(|w|) for |驴| = 1, and is 驴(|w|2) for |驴| 驴 2. Moreover, we introduce an on-line algorithm which directly constructs MASDAWG(w) for given w, whose running time is linear with respect to its size. We also demonstrate some application problems, beginning-sensitive pattern matching, regionsensitive pattern matching, and VLDC-pattern matching, for which AS-DAWGs are useful.