KU: word sense disambiguation by substitution

Authors:
Deniz Yuret
Affiliations:
Koç University, Istanbul, Turkey
Venue:
SemEval '07 Proceedings of the 4th International Workshop on Semantic Evaluations
Year:
2007

Citing 7
Cited 18

Using corpus statistics and WordNet relations for sense identification

Computational Linguistics - Special issue on word sense disambiguation
Evaluating sense disambiguation across diverse parameter spaces

Natural Language Engineering
An empirical study of smoothing techniques for language modeling

ACL '96 Proceedings of the 34th annual meeting on Association for Computational Linguistics
OntoNotes: the 90% solution

NAACL-Short '06 Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers
SemEval-2007 task 06: word-sense disambiguation of prepositions

SemEval '07 Proceedings of the 4th International Workshop on Semantic Evaluations
SemEval-2007 task 10: English lexical substitution task

SemEval '07 Proceedings of the 4th International Workshop on Semantic Evaluations
SemEval-2007 task 17: English lexical sample, SRL and all words

SemEval '07 Proceedings of the 4th International Workshop on Semantic Evaluations

Smoothing a tera-word language model

HLT-Short '08 Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers
Acquiring knowledge from the web to be used as selectors for noun sense disambiguation

CoNLL '08 Proceedings of the Twelfth Conference on Computational Natural Language Learning
Prepositions in applications: A survey and introduction to the special issue

Computational Linguistics
Disambiguation of preposition sense using linguistically motivated features

SRWS '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Student Research Workshop and Doctoral Consortium
Using web selectors for the disambiguation of all words

DEW '09 Proceedings of the Workshop on Semantic Evaluations: Recent Achievements and Future Directions
Modeling morphologically rich languages using split words and unstructured dependencies

ACLShort '09 Proceedings of the ACL-IJCNLP 2009 Conference Short Papers
The noisy channel model for unsupervised word sense disambiguation

Computational Linguistics
A semantic lexicon-based approach for sense disambiguation and its WWW application

ICIC'09 Proceedings of the Intelligent computing 5th international conference on Emerging intelligent computing technology and applications
Evaluation metrics for the lexical substitution task

HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
HMMs, GRs, and n-grams as lexical substitution techniques: are they portable to other languages?

MCTLLL '09 Proceedings of the Workshop on Natural Language Processing Methods and Corpora in Translation, Lexicography, and Language Learning
COLEUR and COLSLM: A WSD approach to multilingual lexical substitution, tasks 2 and 3 SemEval 2010

SemEval '10 Proceedings of the 5th International Workshop on Semantic Evaluation
Unsupervised part of speech tagging using unambiguous substitutes from a statistical language model

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
An efficient indexer for large N-gram corpora

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: Systems Demonstrations
Compositional expectation: a purely distributional model of compositional semantics

IWCS '11 Proceedings of the Ninth International Conference on Computational Semantics
A part-of-speech lexicographic encoding for an evolutionary word sense disambiguation approach

EvoApplications'11 Proceedings of the 2011 international conference on Applications of evolutionary computation - Volume Part I
Latent vector weighting for word meaning in context

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Computational approaches to sentence completion

ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
A challenge set for advancing language modeling

WLM '12 Proceedings of the NAACL-HLT 2012 Workshop: Will We Ever Really Replace the N-gram Model? On the Future of Language Modeling for HLT

Quantified Score

Hi-index	0.00

Visualization

Abstract

Data sparsity is one of the main factors that make word sense disambiguation (WSD) difficult. To overcome this problem we need to find effective ways to use resources other than sense labeled data. In this paper I describe a WSD system that uses a statistical language model based on a large unannotated corpus. The model is used to evaluate the likelihood of various substitutes for a word in a given context. These likelihoods are then used to determine the best sense for the word in novel contexts. The resulting system participated in three tasks in the SemEval 2007 workshop. The WSD of prepositions task proved to be challenging for the system, possibly illustrating some of its limitations: e.g. not all words have good substitutes. The system achieved promising results for the English lexical sample and English lexical substitution tasks.