A quantitative evaluation of global word sense induction

Authors:
Marianna Apidianaki;Tim Van de Cruys
Affiliations:
Alpage, INRIA & University Paris 7, Domaine de Voluceau Rocquencourt, Le Chesnay cedex;Alpage, INRIA & University Paris 7, Domaine de Voluceau Rocquencourt, Le Chesnay cedex
Venue:
CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part I
Year:
2011

Citing 16
Cited 2

Word association norms, mutual information, and lexicography

Computational Linguistics
Discovering word senses from text

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Automatic word sense discrimination

Computational Linguistics - Special issue on word sense disambiguation
Automatic retrieval and clustering of similar words

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
A graph model for unsupervised lexical acquisition

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Feature-rich part-of-speech tagging with a cyclic dependency network

NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
Enriching the knowledge sources used in a maximum entropy part-of-speech tagger

EMNLP '00 Proceedings of the 2000 Joint SIGDAT conference on Empirical methods in natural language processing and very large corpora: held in conjunction with the 38th Annual Meeting of the Association for Computational Linguistics - Volume 13
Word sense disambiguation: A survey

ACM Computing Surveys (CSUR)
Using three way data for word sense discrimination

COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Two graph-based algorithms for state-of-the-art WSD

EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
OntoNotes: the 90% solution

NAACL-Short '06 Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers
Semeval-2007 task 02: evaluating word sense induction and discrimination systems

SemEval '07 Proceedings of the 4th International Workshop on Semantic Evaluations
SemEval-2010 task 14: evaluation setting for word sense induction & disambiguation systems

DEW '09 Proceedings of the Workshop on Semantic Evaluations: Recent Achievements and Future Directions
The role of named entities in web people search

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
SemEval-2010 task 14: Word sense induction & disambiguation

SemEval '10 Proceedings of the 5th International Workshop on Semantic Evaluation
Duluth-WSI: SenseClusters applied to the sense induction task of SemEval-2

SemEval '10 Proceedings of the 5th International Workshop on Semantic Evaluation

Latent semantic word sense induction and disambiguation

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
MaxMax: a graph-based soft clustering algorithm applied to word sense induction

CICLing'13 Proceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part I

Quantified Score

Hi-index	0.00

Visualization

Abstract

Word sense induction (WSI) is the task aimed at automatically identifying the senses of words in texts, without the need for handcrafted resources or annotated data. Up till now, most WSI algorithms extract the different senses of a word 'locally' on a per-word basis, i.e. the different senses for each word are determined separately. In this paper, we compare the performance of such algorithms to an algorithm that uses a 'global' approach, i.e. the different senses of a particular word are determined by comparing them to, and demarcating them from, the senses of other words in a full-blown word space model. We adopt the evaluation framework proposed in the SemEval-2010 Word Sense Induction & Disambiguation task. All systems that participated in this task use a local scheme for determining the different senses of a word. We compare their results to the ones obtained by the global approach, and discuss the advantages and weaknesses of both approaches.