Evaluating sense disambiguation across diverse parameter spaces

Authors:
David Yarowsky;Radu Florian
Affiliations:
Department of Computer Science and Center for Language and Speech Processing, Johns Hopkins University, MD 21218, USA e-mail: yarowsky@cs.jhu.edu, rflorian@cs.jhu.edu;Department of Computer Science and Center for Language and Speech Processing, Johns Hopkins University, MD 21218, USA e-mail: yarowsky@cs.jhu.edu, rflorian@cs.jhu.edu
Venue:
Natural Language Engineering
Year:
2002

Citing 13
Cited 44

Transformation-based error-driven learning and natural language processing: a case study in part-of-speech tagging

Computational Linguistics
Introduction to the special issue on the web as corpus

Computational Linguistics - Special issue on web as corpus
The interaction of knowledge sources in word sense disambiguation

Computational Linguistics
Combining Classifiers for word sense disambiguation

Natural Language Engineering
Unsupervised word sense disambiguation rivaling supervised methods

ACL '95 Proceedings of the 33rd annual meeting on Association for Computational Linguistics
Dynamic nonlocal language modeling via hierarchical topic-based adaptation

ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
A decision tree of bigrams is an accurate predictor of word sense

NAACL '01 Proceedings of the second meeting of the North American Chapter of the Association for Computational Linguistics on Language technologies
Rule writing or annotation: cost-efficient resource usage for base noun phrase chunking

ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
Corpus-based statistical sense resolution

HLT '93 Proceedings of the workshop on Human Language Technology
One sense per collocation

HLT '93 Proceedings of the workshop on Human Language Technology
Coaxing confidences from an old friend: probabilistic classifications from transformation rule lists

EMNLP '00 Proceedings of the 2000 Joint SIGDAT conference on Empirical methods in natural language processing and very large corpora: held in conjunction with the 38th Annual Meeting of the Association for Computational Linguistics - Volume 13
Augmented mixture models for lexical disambiguation

EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
SENSEVAL-2: overview

SENSEVAL '01 The Proceedings of the Second International Workshop on Evaluating Word Sense Disambiguation Systems

Word sense disambiguation with pattern learning and automatic feature selection

Natural Language Engineering
WASPBENCH: a lexicographer's workbench supporting state-of-the-art word sense disambiguation

EACL '03 Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - Volume 2
Modeling consensus: classifier combination for word sense disambiguation

EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Bootstrapping toponym classifiers

HLT-NAACL-GEOREF '03 Proceedings of the HLT-NAACL 2003 workshop on Analysis of geographic references - Volume 1
A detailed comparison of WSD systems: an analysis of the system answers for the SENSEVAL-2 English all words task

Natural Language Engineering
Finding predominant word senses in untagged text

ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
A kernel PCA method for superior word sense disambiguation

ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Word sense disambiguation vs. statistical machine translation

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Domain kernels for word sense disambiguation

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Ensemble methods for unsupervised WSD

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Trajectory based word sense disambiguation

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Semi-supervised training of a kernel PCA-based model for word sense disambiguation

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Domain-specific sense distributions and predominant sense acquisition

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Dependency-Based Construction of Semantic Space Models

Computational Linguistics
Aligning features with sense distinction dimensions

COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Word sense disambiguation: A survey

ACM Computing Surveys (CSUR)
Text Categorization for Improved Priors of Word Meaning

CICLing '07 Proceedings of the 8th International Conference on Computational Linguistics and Intelligent Text Processing
Case-Sensitivity of Classifiers for WSD: Complex Systems Disambiguate Tough Words Better

CICLing '07 Proceedings of the 8th International Conference on Computational Linguistics and Intelligent Text Processing
Co-dispersion: a windowless approach to lexical association

EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
An evaluation of a lexicographer's workbench: building lexicons for machine translation

EAMT '03 Proceedings of the 7th International EAMT workshop on MT and other Language Technology Tools, Improving MT through other Language Technology Tools: Resources and Tools for Building MT
Estimating and exploiting the entropy of sense distributions

NAACL-Short '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers
CITYU-HIF: WSD with human-informed feature preference

SemEval '07 Proceedings of the 4th International Workshop on Semantic Evaluations
KU: word sense disambiguation by substitution

SemEval '07 Proceedings of the 4th International Workshop on Semantic Evaluations
OE: WSD using optimal ensembling (OE) method

SemEval '07 Proceedings of the 4th International Workshop on Semantic Evaluations
On the use of automatically acquired examples for all-nouns word sense disambiguation

Journal of Artificial Intelligence Research
Graph connectivity measures for unsupervised word sense disambiguation

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
From predicting predominant senses to local context for word sense disambiguation

STEP '08 Proceedings of the 2008 Conference on Semantics in Text Processing
A comparison of windowless and window-based computational association measures as predictors of syntagmatic human associations

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
The noisy channel model for unsupervised word sense disambiguation

Computational Linguistics
Local context selection for aligning sentences in parallel corpora

CONTEXT'07 Proceedings of the 6th international and interdisciplinary conference on Modeling and using context
Context-based sentence alignment in parallel corpora

CICLing'08 Proceedings of the 9th international conference on Computational linguistics and intelligent text processing
An evaluation of a lexicographer's workbench incorporating word sense disambiguation

CICLing'03 Proceedings of the 4th international conference on Computational linguistics and intelligent text processing
BabelNet: building a very large multilingual semantic network

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
WSD as a distributed constraint optimization problem

ACLstudent '10 Proceedings of the ACL 2010 Student Research Workshop
Assessing the contribution of shallow and deep knowledge sources for word sense disambiguation

Language Resources and Evaluation
What's in a preposition?: dimensions of sense disambiguation for an interesting word class

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Incorporating coreference resolution into word sense disambiguation

CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part I
Defining classifier regions for WSD ensembles using word space features

MICAI'06 Proceedings of the 5th Mexican international conference on Artificial Intelligence
Building an optimal WSD ensemble using per-word selection of best system

CIARP'06 Proceedings of the 11th Iberoamerican conference on Progress in Pattern Recognition, Image Analysis and Applications
Robust utilization of context in word sense disambiguation

CONTEXT'05 Proceedings of the 5th international conference on Modeling and Using Context
Unsupervised similarity-based word sense disambiguation using context vectors and sentential word importance

ACM Transactions on Speech and Language Processing (TSLP)
A semi-supervised approach for key-synset extraction to be used in word sense disambiguation

AIRS'11 Proceedings of the 7th Asia conference on Information Retrieval Technology
BabelNet: The automatic construction, evaluation and application of a wide-coverage multilingual semantic network

Artificial Intelligence
Building instance knowledge network for word sense disambiguation

ACSC '11 Proceedings of the Thirty-Fourth Australasian Computer Science Conference - Volume 113

Quantified Score

Hi-index	0.01

Visualization

Abstract

This paper presents a comprehensive empirical exploration and evaluation of a diverse range of data characteristics which influence word sense disambiguation performance. It focuses on a set of six core supervised algorithms, including three variants of Bayesian classifiers, a cosine model, non-hierarchical decision lists, and an extension of the transformation-based learning model. Performance is investigated in detail with respect to the following parameters: (a) target language (English, Spanish, Swedish and Basque); (b) part of speech; (c) sense granularity; (d) inclusion and exclusion of major feature classes; (e) variable context width (further broken down by part-of-speech of keyword); (f) number of training examples; (g) baseline probability of the most likely sense; (h) sense distributional entropy; (i) number of senses per keyword; (j) divergence between training and test data; (k) degree of (artificially introduced) noise in the training data; (l) the effectiveness of an algorithm's confidence rankings; and (m) a full keyword breakdown of the performance of each algorithm. The paper concludes with a brief analysis of similarities, differences, strengths and weaknesses of the algorithms and a hierarchical clustering of these algorithms based on agreement of sense classification behavior. Collectively, the paper constitutes the most comprehensive survey of evaluation measures and tests yet applied to sense disambiguation algorithms. And it does so over a diverse range of supervised algorithms, languages and parameter spaces in single unified experimental framework.