Some advances in transformation-based part of speech tagging
AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
Building probabilistic models for natural language
Building probabilistic models for natural language
Shallow parsing using specialized hmms
The Journal of Machine Learning Research
Chinese named entity recognition using lexicalized HMMs
ACM SIGKDD Explorations Newsletter - Natural language processing and text mining
Chinese word segmentation as morpheme-based lexical chunking
Information Sciences: an International Journal
Dependency-Based Chinese-English Statistical Machine Translation
CICLing '07 Proceedings of the 8th International Conference on Computational Linguistics and Intelligent Text Processing
ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
OpinionMiner: a novel machine learning system for web opinion mining and extraction
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Hi-index | 0.00 |
Since most previous works for HMM-based tagging consider only part-of-speech information in contexts, their models cannot utilize lexical information which is crucial for resolving some morphological ambiguity. In this paper we introduce uniformly lexicalized HMMs for part-of-speech tagging in both English and Korean. The lexicalized models use a simplified back-off smoothing technique to overcome data sparseness. In experiments, lexicalized models achieve higher accuracy than non-lexicalized models and the back-off smoothing method mitigates data sparseness better than simple smoothing methods.