Evaluating Word Sense Induction and Disambiguation Methods

Authors:
Ioannis P. Klapaftis;Suresh Manandhar
Affiliations:
Microsoft Corporation, Redmond, USA;Department of Computer Science, University of York, York, UK
Venue:
Language Resources and Evaluation
Year:
2013

Citing 31
Cited 0

Term-weighting approaches in automatic text retrieval

Information Processing and Management: an International Journal
Extending a Lexical Ontology by a Combination of Distributional Semantics Signatures

EKAW '02 Proceedings of the 13th International Conference on Knowledge Engineering and Knowledge Management. Ontologies and the Semantic Web
Latent dirichlet allocation

The Journal of Machine Learning Research
Automatic word sense discrimination

Computational Linguistics - Special issue on word sense disambiguation
Entity-based cross-document coreferencing using the Vector Space Model

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
The Berkeley FrameNet Project

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Discovering corpus-specific word senses

EACL '03 Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - Volume 2
Concept discovery from text

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Bidirectional inference with the easiest-first strategy for tagging sequence data

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Automatic cluster stopping with criterion functions and the gap statistic

NAACL-Demonstrations '06 Proceedings of the 2006 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology: companion volume: demonstrations
A comparison of extrinsic clustering evaluation metrics based on formal constraints

Information Retrieval
Bayesian word sense induction

EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
Two graph-based algorithms for state-of-the-art WSD

EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
SenseClusters: finding clusters that represent word senses

HLT-NAACL--Demonstrations '04 Demonstration Papers at HLT-NAACL 2004
OntoNotes: the 90% solution

NAACL-Short '06 Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers
Semeval-2007 task 02: evaluating word sense induction and discrimination systems

SemEval '07 Proceedings of the 4th International Workshop on Semantic Evaluations
I2R: three systems for word sense discrimination, Chinese word sense disambiguation, and English word sense disambiguation

SemEval '07 Proceedings of the 4th International Workshop on Semantic Evaluations
UBC-AS: a graph based unsupervised system for induction and classification

SemEval '07 Proceedings of the 4th International Workshop on Semantic Evaluations
UMND2: SenseClusters applied to the sense induction task of Senseval-4

SemEval '07 Proceedings of the 4th International Workshop on Semantic Evaluations
UPV-SI: word sense induction using self term expansion

SemEval '07 Proceedings of the 4th International Workshop on Semantic Evaluations
Chinese whispers: an efficient graph clustering algorithm and its application to natural language processing problems

TextGraphs-1 Proceedings of the First Workshop on Graph Based Methods for Natural Language Processing
Evaluating and optimizing the parameters of an unsupervised graph-based WSD algorithm

TextGraphs-1 Proceedings of the First Workshop on Graph Based Methods for Natural Language Processing
The role of named entities in web people search

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
SemEval-2010 task 14: Word sense induction & disambiguation

SemEval '10 Proceedings of the 5th International Workshop on Semantic Evaluation
KCDC: Word sense induction by using grammatical dependencies and sentence phrase structure

SemEval '10 Proceedings of the 5th International Workshop on Semantic Evaluation
UoY: Graphs of unambiguous vertices for word sense induction and disambiguation

SemEval '10 Proceedings of the 5th International Workshop on Semantic Evaluation
HERMIT: Flexible clustering for the SemEval-2 WSI task

SemEval '10 Proceedings of the 5th International Workshop on Semantic Evaluation
Duluth-WSI: SenseClusters applied to the sense induction task of SemEval-2

SemEval '10 Proceedings of the 5th International Workshop on Semantic Evaluation
KSU KDD: Word sense induction by clustering in topic space

SemEval '10 Proceedings of the 5th International Workshop on Semantic Evaluation
Inducing word senses to improve web search result clustering

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Word sense induction & disambiguation using hierarchical random graphs

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Word Sense Induction (WSI) is the task of identifying the different uses (senses) of a target word in a given text in an unsupervised manner, i.e. without relying on any external resources such as dictionaries or sense-tagged data. This paper presents a thorough description of the SemEval-2010 WSI task and a new evaluation setting for sense induction methods. Our contributions are two-fold: firstly, we provide a detailed analysis of the Semeval-2010 WSI task evaluation results and identify the shortcomings of current evaluation measures. Secondly, we present a new evaluation setting by assessing participating systems' performance according to the skewness of target words' distribution of senses showing that there are methods able to perform well above the Most Frequent Sense (MFS) baseline in highly skewed distributions.