Compressed automata for dictionary matching

  • Authors:
  • I Tomohiro;Takaaki Nishimoto;Shunsuke Inenaga;Hideo Bannai;Masayuki Takeda

  • Affiliations:
  • Department of Informatics, Kyushu University, Japan,Japan Society for the Promotion of Science (JSPS), Japan;Department of Informatics, Kyushu University, Japan;Department of Informatics, Kyushu University, Japan;Department of Informatics, Kyushu University, Japan;Department of Informatics, Kyushu University, Japan

  • Venue:
  • CIAA'13 Proceedings of the 18th international conference on Implementation and Application of Automata
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

A variant of the dictionary matching problem is addressed where the dictionary is given in an SLP-compressed form. An Aho-Corasick automata-based algorithm is presented which pre-processes the compressed dictionary $\mathcal{D}$ in O(n4logn) time using O(n2logN) space and recognizes all occurrences of the patterns in $\mathcal{D}$ in amortized O(h+m) running time per character, where n and N are, respectively, the compressed and uncompressed sizes of $\mathcal{D}$, and h is the height of $\mathcal{D}$, and m is the number of patterns in the dictionary.