Complexity, Two-Level Morphology and Finnish
COLING '88 Proceedings of the 12th conference on Computational linguistics - Volume 1
The Mathematical Theory of Context-Free Languages
The Mathematical Theory of Context-Free Languages
A stochastic finite-state word-segmentation algorithm for Chinese
Computational Linguistics
ARIES: A lexical platform for engineering Spanish processing tools
Natural Language Engineering
Multilingual text analysis for text-to-speech synthesis
Natural Language Engineering
Expansion of multi-word terms for indexing and retrieval using morphology and syntax
ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
A stochastic finite-state word-segmentation algorithm for Chinese
ACL '94 Proceedings of the 32nd annual meeting on Association for Computational Linguistics
ACL '94 Proceedings of the 32nd annual meeting on Association for Computational Linguistics
Machine-readable dictionaries in text-to-speech systems
COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 2
Issues in text-to-speech for French
COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 2
Weighted rational transductions and their application to human language processing
HLT '94 Proceedings of the workshop on Human Language Technology
Feature structures, unification and finite-state transducers
FSMNLP '09 Proceedings of the International Workshop on Finite State Methods in Natural Language Processing
Automatic conjugation and identification of regular and irregular verb neologisms in Spanish
CALC '10 Proceedings of the NAACL HLT 2010 Second Workshop on Computational Approaches to Linguistic Creativity
Onoma: a linguistically motivated conjugation system for spanish verbs
CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part I
Hi-index | 0.04 |
A finite transducer that processes Spanish inflectional and derivational morphology is presented. The system handles both generation and analysis of tens of millions inflected forms. Lexical and surface (orthographic) representations of the words are linked by a program that interprets a finite directed graph whose arcs are labelled by n-tuples of strings. Each of about 55,000 base forms requires at least one are in the graph. Representing the inflectional and derivational possibilities for these forms imposed an overhead of only about 3000 additional arcs, of which about 2500 represent (phonologically predictable) stem allomorphy, so that we pay a storage price of about 5% for compiling these forms offline. A simple interpreter for the resulting automaton processes several hundred words per second on a Sun4.