Grammatical category disambiguation by statistical optimization
Computational Linguistics
Morphological parsing and the lexicon
Lexical representation and process
Some advances in transformation-based part of speech tagging
AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
Regular models of phonological rule systems
Computational Linguistics - Special issue on computational phonology
Constraint Grammar: A Language-Independent System for Parsing Unrestricted Text
Constraint Grammar: A Language-Independent System for Parsing Unrestricted Text
A stochastic parts program and noun phrase parser for unrestricted text
ANLC '88 Proceedings of the second conference on Applied natural language processing
Tagging and morphological disambiguation of Turkish text
ANLC '94 Proceedings of the fourth conference on Applied natural language processing
A practical part-of-speech tagger
ANLC '92 Proceedings of the third conference on Applied natural language processing
A simple rule-based part of speech tagger
ANLC '92 Proceedings of the third conference on Applied natural language processing
Ambiguity resolution in a reductionistic parser
EACL '93 Proceedings of the sixth conference on European chapter of the Association for Computational Linguistics
A syntax-based part-of-speech analyser
EACL '95 Proceedings of the seventh conference on European chapter of the Association for Computational Linguistics
Tagging English by path voting constraints
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
Combining stochastic and rule-based methods for disambiguation in agglutinative languages
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Statistical morphological disambiguation for agglutinative languages
COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 1
Learning morphological disambiguation rules for Turkish
HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
Turkish Language Resources: Morphological Parser, Morphological Disambiguator and Web Corpus
GoTAL '08 Proceedings of the 6th international conference on Advances in Natural Language Processing
Algorithms for the coalitional manipulation problem
Artificial Intelligence
Morphological Disambiguation of Turkish Text with Perceptron Algorithm
CICLing '07 Proceedings of the 8th International Conference on Computational Linguistics and Intelligent Text Processing
The best of two worlds: cooperation of statistical and rule-based taggers for Czech
ACL '07 Proceedings of the Workshop on Balto-Slavonic Natural Language Processing: Information Extraction and Enabling Technologies
Implementing voting constraints with finite state transducers
FSMNLP '09 Proceedings of the International Workshop on Finite State Methods in Natural Language Processing
Resources for Turkish morphological processing
Language Resources and Evaluation
Resources for Turkish morphological processing
Language Resources and Evaluation
Pronunciation disambiguation in turkish
ISCIS'05 Proceedings of the 20th international conference on Computer and Information Sciences
Morphological annotation of a corpus with a collaborative multiplayer game
CICLing'10 Proceedings of the 11th international conference on Computational Linguistics and Intelligent Text Processing
Morpheme segmentation in the METU-Sabancı Turkish treebank
LAW VI '12 Proceedings of the Sixth Linguistic Annotation Workshop
Hi-index | 0.00 |
We present a constraint-based morphological disambiguation system in which individual constraints vote on matching morphological parses, and disambiguation of all the tokens in a sentence is performed at the end by selecting parses that receive the highest votes. This constraint application paradigm makes the outcome of the disambiguation independent of the rule sequence, and hence relieves the rule developer from worrying about potentially conflicting rule sequencing. Our results for disambiguating Turkish indicate that using about 500 constraint rules and some additional simple statistics, we can attain a recall of 95--96% and a precision of 94--95% with about 1.01 parses per token. Our system is implemented in Prolog and we are currently investigating an efficient implementation based on finite state transducers.