Choice of grammatical word-class without global syntactic analysis: tagging words in the LOB Corpus.
Computers and the Humanities
Probabilistic models of short and long distance word dependencies in running text
HLT '89 Proceedings of the workshop on Speech and Natural Language
Coping with ambiguity and unknown words through probabilistic models
Computational Linguistics - Special issue on using large corpora: II
A practical part-of-speech tagger
ANLC '92 Proceedings of the third conference on Applied natural language processing
Ambiguity resolution in a reductionistic parser
EACL '93 Proceedings of the sixth conference on European chapter of the Association for Computational Linguistics
ACL '90 Proceedings of the 28th annual meeting on Association for Computational Linguistics
Constraint grammar as a framework for parsing running text
COLING '90 Proceedings of the 13th conference on Computational linguistics - Volume 3
Annotating 200 million words: the Bank of English project
COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 1
Syntactic analysis of natural language using linguistic rules and corpus-based patterns
COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 1
Part of Speech Tagging from a Logical Point of View
LACL '01 Proceedings of the 4th International Conference on Logical Aspects of Computational Linguistics
Natural Language Engineering
ANLC '97 Proceedings of the fifth conference on Applied natural language processing
Tagging French: comparing a statistical and a constraint-based method
EACL '95 Proceedings of the seventh conference on European chapter of the Association for Computational Linguistics
A syntax-based part-of-speech analyser
EACL '95 Proceedings of the seventh conference on European chapter of the Association for Computational Linguistics
Comparing a linguistic and a stochastic tagger
ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Combining stochastic and rule-based methods for disambiguation in agglutinative languages
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Annotating topological fields and chunks: and revising POS tags at the same time
COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
How to integrate linguistic information in FILES and generate feedback for grammar errors
STAR '01 Proceedings of the ACL 2001 Workshop on Sharing Tools and Resources - Volume 15
A Korean Part-of-Speech Tagging System Using Resolution Rules for Individual Ambiguous Word
ICCS '07 Proceedings of the 7th international conference on Computational Science, Part II
Evaluation of the Syntactic Annotation in EPEC, the Reference Corpus for the Processing of Basque
CICLing '09 Proceedings of the 10th International Conference on Computational Linguistics and Intelligent Text Processing
The best of two worlds: cooperation of statistical and rule-based taggers for Czech
ACL '07 Proceedings of the Workshop on Balto-Slavonic Natural Language Processing: Information Extraction and Enabling Technologies
Evaluating GETARUNS parser with GREVAL test suite
ROMAND '04 Proceedings of the 3rd Workshop on RObust Methods in Analysis of Natural Language Data
Automatic Documentation and Mathematical Linguistics
A TENGRAM method based part-of-speech tagging of multi-category words in Hindi language
Expert Systems with Applications: An International Journal
Hi-index | 0.00 |
We discuss combining knowledge-based (or rule-based) and statistical part-of-speech taggers. We use two mature taggers, ENGCG and Xerox Tagger, to independently tag the same text and combine the results to produce a fully disambiguated text. In a 27000 word test sample taken from a previously unseen corpus we achieve 98.5% accuracy. This paper presents the data in detail. We describe the problems we encountered in the course of combining the two taggers and discuss the problem of evaluating taggers.