Treatment of ε-moves in subset construction

  • Authors:
  • Gertjan van Noord

  • Affiliations:
  • University of Groningen, Netherlands

  • Venue:
  • FSMNLP '09 Proceedings of the International Workshop on Finite State Methods in Natural Language Processing
  • Year:
  • 1998

Quantified Score

Hi-index 0.00

Visualization

Abstract

The paper discusses the problem of determinising finite-state automata containing large numbers of ε-moves. Experiments with finite-state approximations of natural language grammars often give rise to very large automata with a very large number of ε-moves. The paper identifies three subset construction algorithms which treat ε-moves. A number of experiments has been performed which indicate that the algorithms differ considerably in practice. Furthermore, the experiments suggest that the average number of ε-moves per state can be used to predict which algorithm is likely to perform best for a given input automaton.