Compiling simple context restrictions with nondeterministic automata

Authors:
Anssi Yli-Jyrä
Affiliations:
University of Helsinki, Finland
Venue:
FSMNLP '11 Proceedings of the 9th International Workshop on Finite State Methods and Natural Language Processing
Year:
2011

Citing 11
Cited 0

Regular models of phonological rule systems

Computational Linguistics - Special issue on computational phonology
Compiling and using finite-state syntactic rules

COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 1
Compiling a partition-based two-level formalism

COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 1
Parallel replacement in finite state calculus

COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 2
Logical specification of finite-state transductions for natural language processing

Logical specification of finite-state transductions for natural language processing
Describing syntax with star-free regular expressions

EACL '03 Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - Volume 1
Counter-Free Automata (M.I.T. research monograph no. 65)

Counter-Free Automata (M.I.T. research monograph no. 65)
Introduction to Automata Theory, Languages, and Computation (3rd Edition)

Introduction to Automata Theory, Languages, and Computation (3rd Edition)
A bimachine compiler for ranked tagging rules

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Regular Expressions and Predicate Logic in Finite-State Language Processing

Proceedings of the 2009 conference on Finite-State Methods and Natural Language Processing: Post-proceedings of the 7th International Workshop FSMNLP 2008
A method for compiling two-level rules with multiple contexts

SIGMORPHON '10 Proceedings of the 11th Meeting of the ACL Special Interest Group on Computational Morphology and Phonology

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper describes a non-conventional method for compiling (phonological or morpho-syntactic) context restriction (CR) constraints into non-deterministic automata in finite-state tools and surface parsing systems. The method reduces any CR into a simple one that constraints the occurrences of the empty string and represents right contexts with co-determististic states. In cases where a fully deterministic representation would be exponentially larger, this kind of inward determinism in contexts can bring benefits over various De Morgan approaches where full determinization is necessary. In the method, an accepted word gets a unique path that is a projection of a ladder-shaped structure in the context recognizer. This projection is computed in time that is polynomial to the number of context states. However, it may be difficult to take advantage of the method in a finite-state library that coerces intermediate results into canonical automata and whose intersection operation assumes deterministic automata.