Chinese and Japanese word segmentation using word-level and character-level information
COLING '04 Proceedings of the 20th international conference on Computational Linguistics
A Joint Segmenting and Labeling Approach for Chinese Lexical Analysis
ECML PKDD '08 Proceedings of the European conference on Machine Learning and Knowledge Discovery in Databases - Part II
A Chinese-Japanese Lexical Machine Translation through a Pivot Language
ACM Transactions on Asian Language Information Processing (TALIP)
Using Short Dependency Relations from Auto-Parsed Data for Chinese Dependency Parsing
ACM Transactions on Asian Language Information Processing (TALIP)
Chinese dependency parsing with large scale automatically constructed case structures
COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Extracting bilingual dictionary from comparable corpora with dependency heterogeneity
NAACL-Short '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers
An error-driven word-character hybrid model for joint Chinese word segmentation and POS tagging
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
A fast decoder for joint word segmentation and POS-tagging using a single discriminative model
EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Syntactic processing using the generalized perceptron and beam search
Computational Linguistics
Iterative annotation transformation with predict-self reestimation for Chinese word segmentation
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Automatic Korean word spacing using Pegasos algorithm
Information Processing and Management: an International Journal
Hi-index | 0.00 |
In this paper, we present a hybrid method for word segmentation and POS tagging. The target languages are those in which word boundaries are ambiguous, such as Chinese and Japanese. In the method, word-based and character-based processing is combined, and word segmentation and POS tagging are conducted simultaneously. Experimental results on multiple corpora show that the integrated method has high accuracy.