C4.5: programs for machine learning
C4.5: programs for machine learning
Natural language parsing as statistical pattern recognition
Natural language parsing as statistical pattern recognition
Tree Induction for Probability-Based Ranking
Machine Learning
TnT: a statistical part-of-speech tagger
ANLC '00 Proceedings of the sixth conference on Applied natural language processing
Tagging inflective languages: prediction of morphological categories for a rich, structured tagset
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Probabilistic tagging with feature structures
COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 1
Serial combination of rules and statistics: a case study in Czech tagging
ACL '01 Proceedings of the 39th Annual Meeting on Association for Computational Linguistics
Feature-rich part-of-speech tagging with a cyclic dependency network
NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
The best of two worlds: cooperation of statistical and rule-based taggers for Czech
ACL '07 Proceedings of the Workshop on Balto-Slavonic Natural Language Processing: Information Extraction and Enabling Technologies
Part-of-speech tagging of Northern Sotho: disambiguating polysemous function words
AfLaT '09 Proceedings of the First Workshop on Language Technologies for African Languages
Tagging Urdu text with parts of speech: a tagger comparison
EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
Entity-based local coherence modelling using topological fields
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Towards robust multi-tool tagging. An OWL/DL-based approach
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
SemEval-2010 task 1: Coreference resolution in multiple languages
SemEval '10 Proceedings of the 5th International Workshop on Semantic Evaluation
LIMSI's statistical translation systems for WMT'10
WMT '10 Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR
Vs and OOVs: two problems for translation between German and English
WMT '10 Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR
Second-order HMM for event extraction from short message
NLDB'10 Proceedings of the Natural language processing and information systems, and 15th international conference on Applications of natural language to information systems
Improving reordering with linguistically informed bilingual n-grams
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
A logistic regression model of determiner omission in PPs
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
The Karlsruhe Institute of Technology translation systems for the WMT 2011
WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
Combining polish morphosyntactic taggers
SIIS'11 Proceedings of the 2011 international conference on Security and Intelligent Information Systems
Data point selection for self-training
SPMRL '11 Proceedings of the Second Workshop on Statistical Parsing of Morphologically Rich Languages
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
WMT '12 Proceedings of the Seventh Workshop on Statistical Machine Translation
The Karlsruhe institute of technology translation systems for the WMT 2012
WMT '12 Proceedings of the Seventh Workshop on Statistical Machine Translation
Rule-Based morphological tagger for an inflectional language
COST'11 Proceedings of the 2011 international conference on Cognitive Behavioural Systems
Generation of compound words in statistical machine translation into compounding languages
Computational Linguistics
Hi-index | 0.00 |
We present a HMM part-of-speech tagging method which is particularly suited for POS tagsets with a large number of fine-grained tags. It is based on three ideas: (1) splitting of the POS tags into attribute vectors and decomposition of the contextual POS probabilities of the HMM into a product of attribute probabilities, (2) estimation of the contextual probabilities with decision trees, and (3) use of high-order HMMs. In experiments on German and Czech data, our tagger outperformed state-of-the-art POS taggers.