On the implementation of compact DAWG's

  • Authors:
  • Jan Holub;Maxime Crochemore

  • Affiliations:
  • Department of Computer Science and Engineering, Czech Technical University, Czech Republic;Gaspard-Monge Institute, University of Marne-la-Vallée, France

  • Venue:
  • CIAA'02 Proceedings of the 7th international conference on Implementation and application of automata
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

There are several data structures that allow searching for a pattern P in a preprocessed text T in time dependent just on the length of P. In this paper we present an implementation of CDAWG's-- Compact Direct Acyclic Word Graphs. While the previous implementations of CDAWG's required from 7n to 23n bytes of memory space, ours achieves 1.7n to 5n for a text T of length n. The implementation is suitable for large data files, since it minimizes the number of disk accesses. If disk accesses are not to be optimized, space requirements can be further decreased.