Some advances in transformation-based part of speech tagging
AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
Building a large annotated corpus of English: the penn treebank
Computational Linguistics - Special issue on using large corpora: II
Coping with ambiguity and unknown words through probabilistic models
Computational Linguistics - Special issue on using large corpora: II
A syntax-based part-of-speech analyser
EACL '95 Proceedings of the seventh conference on European chapter of the Association for Computational Linguistics
Part-of-speech tagging with neural networks
COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 1
Morphological analysis and synthesis by automated discovery and acquisition of linguistic rules
COLING '90 Proceedings of the 13th conference on Computational linguistics - Volume 2
Automatic rule induction for unknown-word guessing
Computational Linguistics
Unsupervised learning of part-of-speech guessing rules
Natural Language Engineering
Predicting part-of-speech information about unknown words using statistical methods
ACL '98 Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics - Volume 2
Automatic refinement of a POS tagger using a reliable parser and plain text corpora
COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 1
Learning part-of-speech guessing rules from lexicon: extension to non-concatenative operations
COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 2
A second-order Hidden Markov Model for part-of-speech tagging
ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Guessers for Finite-State Transducer Lexicons
CICLing '09 Proceedings of the 10th International Conference on Computational Linguistics and Intelligent Text Processing
Automatic lexical acquisition from raw corpora: an application to Russian
MorphSlav '03 Proceedings of the 2003 EACL Workshop on Morphological Processing of Slavic Languages
Inferring shallow-transfer machine translation rules from small parallel corpora
Journal of Artificial Intelligence Research
Hi-index | 0.00 |
Words unknown to the lexicon present a substantial problem to part-of-speech tagging. In this paper we present a technique for fully unsupervised statistical acquisition of rules which guess possible parts-of-speech for unknown words. Three complementary sets of word-guessing rules are induced from the lexicon and a raw corpus: prefix morphological rules, suffix morphological rules and ending-guessing rules. The learning was performed on the Brown Corpus data and rule-sets, with a highly competitive performance, were produced and compared with the state-of-the-art.