Assessing system agreement and instance difficulty in the lexical sample tasks of SENSEVAL-2

Authors:
Ted Pedersen
Affiliations:
University of Minnesota, Duluth, MN
Venue:
WSD '02 Proceedings of the ACL-02 workshop on Word sense disambiguation: recent successes and future directions - Volume 8
Year:
2002

Citing 2
Cited 9

Assessing agreement on classification tasks: the kappa statistic

Computational Linguistics
Machine learning with lexical features: the Duluth approach to Senseval-2

SENSEVAL '01 The Proceedings of the Second International Workshop on Evaluating Word Sense Disambiguation Systems

Distinguishing easy and hard instances

COLING-02 proceedings of the 6th conference on Natural language learning - Volume 20
A detailed comparison of WSD systems: an analysis of the system answers for the SENSEVAL-2 English all words task

Natural Language Engineering
Case-Sensitivity of Classifiers for WSD: Complex Systems Disambiguate Tough Words Better

CICLing '07 Proceedings of the 8th International Conference on Computational Linguistics and Intelligent Text Processing
Making sense of word sense variation

DEW '09 Proceedings of the Workshop on Semantic Evaluations: Recent Achievements and Future Directions
Combining knowledge- and corpus-based word-sense-disambiguation methods

Journal of Artificial Intelligence Research
Paraphrase identification using machine learning techniques

ICNVS'10 Proceedings of the 12th international conference on Networking, VLSI and signal processing
Anveshan: a framework for analysis of multiple annotators' labeling behavior

LAW IV '10 Proceedings of the Fourth Linguistic Annotation Workshop
Paraphrase identification on the basis of supervised machine learning techniques

FinTAL'06 Proceedings of the 5th international conference on Advances in Natural Language Processing
Multiplicity and word sense: evaluating and learning from multiply labeled word sense annotations

Language Resources and Evaluation

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents a comparative evaluation among the systems that participated in the Spanish and English lexical sample tasks of SENSEVAL-2. The focus is on pairwise comparisons among systems to assess the degree to which they agree, and on measuring the difficulty of the test instances included in these tasks.