Mapping lexical entries in a verbs database to WordNet senses

Authors:
Rebecca Green;Lisa Pearl;Bonnie J. Dorr;Philip Resnik
Affiliations:
University of Maryland, College Park, MD;University of Maryland, College Park, MD;University of Maryland, College Park, MD;University of Maryland, College Park, MD
Venue:
ACL '01 Proceedings of the 39th Annual Meeting on Association for Computational Linguistics
Year:
2001

Citing 6
Cited 5

Assessing agreement on classification tasks: the kappa statistic

Computational Linguistics
Machine Learning

Machine Learning
Trainable methods for surface natural language generation

NAACL 2000 Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference
Deriving verbal and compositional lexical aspect for NLP applications

ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Improving data driven wordclass tagging by system combination

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Corpus-based lexical choice in natural language generation

ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics

Construction of a Chinese–English Verb Lexicon for Machine Translation and Embedded Multilingual Applications

Machine Translation
Augmenting noun taxonomies by combining lexical similarity metrics

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
A categorial variation database for English

NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
Integrating semantic frames from multiple sources

CICLing'06 Proceedings of the 7th international conference on Computational Linguistics and Intelligent Text Processing
Structure based semantic measurement for information filtering agents

AOW '07 Proceedings of the Third Australasian Workshop on Advances in Ontologies - Volume 85

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper describes automatic techniques for mapping 9611 entries in a database of English verbs to WordNet senses. The verbs were initially grouped into 491 classes based on syntactic features. Mapping these verbs into WordNet senses provides a resource that supports disambiguation in multilingual applications such as machine translation and cross-language information retrieval. Our techniques make use of (1) a training set of 1791 disambiguated entries, representing 1442 verb entries from 167 classes; (2) word sense probabilities, from frequency counts in a tagged corpus; (3) semantic similarity of WordNet senses for verbs within the same class; (4) probabilistic correlations between WordNet data and attributes of the verb classes. The best results achieved 72% precision and 58% recall, versus a lower bound of 62% precision and 38% recall for assigning the most frequently occurring WordNet sense, and an upper bound of 87% precision and 75% recall for human judgment.