Memory-based learning: using similarity for smoothing
ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Restructuring tagged corpora with morpheme adjustment rules
COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 1
Hi-index | 0.01 |
Today, many kinds of tagged corpora are available for research use. Often a different morphological system is used in each corpus. This makes it difficult to merge different types of morphological information, since conversion between different systems is complex and necessitates a understanding of both systems. This paper describes a method of converting morphological information between two different systems by using lexicalized and general conversion. The difference between lexicalized and general conversion is the existence or absence of a lexicalized condition. Which conversion is applied depends on the frequency of segments.