Reusing an ontology to generate numeral classifiers
COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 1
Classifier assignment by corpus-based approach
COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 1
Stemming Indonesian: A confix-stripping approach
ACM Transactions on Asian Language Information Processing (TALIP)
Hi-index | 0.00 |
We examine the capacity of Web and corpus frequency methods to predict preferred count classifiers for nouns in Malay. The observed F-score for the Web model of 0.671 considerably outperformed corpus-based frequency and machine learning models. We expect that this is a fruitful extension for Web-as-corpus approaches to lexicons in languages other than English, but further research is required in other South-East and East Asian languages.