Ternary directed acyclic word graphs

  • Authors:
  • Satoru Miyamoto;Shunsuke Inenaga;Masayuki Takeda;Ayumi Shinohara

  • Affiliations:
  • Department of Informatics, Kyushu University 33, Fukuoka, Japan;Department of Informatics, Kyushu University 33, Fukuoka, Japan and PRESTO, Japan Science and Technology Corporation;Department of Informatics, Kyushu University 33, Fukuoka, Japan and PRESTO, Japan Science and Technology Corporation;Department of Informatics, Kyushu University 33, Fukuoka, Japan and PRESTO, Japan Science and Technology Corporation

  • Venue:
  • CIAA'03 Proceedings of the 8th international conference on Implementation and application of automata
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

Given a set S of strings, a DFA accepting S offers a very time-efficient solution to the pattern matching problem over S. The key is how to implement such a DFA in the trade-off between time and space, and especially the choice of how to implement the transitions of each state is critical. Bentley and Sedgewick proposed an effective tree structure called ternary trees. The idea of ternary trees is to 'implant' the process of binary search for transitions into the structure of the trees themselves. This way the process of binary search becomes visible, and the implementation of the trees becomes quite easy. The directed acyclic word graph (DAWG) of a string w is the smallest DFA that accepts all suffixes of w, and requires only linear space. We apply the scheme of ternary trees to DAWGs, introducing a new data structure named ternary DAWGs (TDAWGs). We perform some experiments that show the efficiency of TDAWGs, compared to DAWGs in which transitions are implemented by tables and linked lists.