Assessing agreement on classification tasks: the kappa statistic
Computational Linguistics
Machine Learning
Trainable methods for surface natural language generation
NAACL 2000 Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference
Deriving verbal and compositional lexical aspect for NLP applications
ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Improving data driven wordclass tagging by system combination
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Corpus-based lexical choice in natural language generation
ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
Augmenting noun taxonomies by combining lexical similarity metrics
COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
A categorial variation database for English
NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
Integrating semantic frames from multiple sources
CICLing'06 Proceedings of the 7th international conference on Computational Linguistics and Intelligent Text Processing
Structure based semantic measurement for information filtering agents
AOW '07 Proceedings of the Third Australasian Workshop on Advances in Ontologies - Volume 85
Hi-index | 0.00 |
This paper describes automatic techniques for mapping 9611 entries in a database of English verbs to WordNet senses. The verbs were initially grouped into 491 classes based on syntactic features. Mapping these verbs into WordNet senses provides a resource that supports disambiguation in multilingual applications such as machine translation and cross-language information retrieval. Our techniques make use of (1) a training set of 1791 disambiguated entries, representing 1442 verb entries from 167 classes; (2) word sense probabilities, from frequency counts in a tagged corpus; (3) semantic similarity of WordNet senses for verbs within the same class; (4) probabilistic correlations between WordNet data and attributes of the verb classes. The best results achieved 72% precision and 58% recall, versus a lower bound of 62% precision and 38% recall for assigning the most frequently occurring WordNet sense, and an upper bound of 87% precision and 75% recall for human judgment.