Unknown word sense detection as outlier detection

Authors:
Katrin Erk
Affiliations:
Saarland University Saarbrücken, Germany
Venue:
HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
Year:
2006

Citing 12
Cited 4

Outlier Detection Using Classifier Instability

SSPR '98/SPR '98 Proceedings of the Joint IAPR International Workshops on Advances in Pattern Recognition
Parzen-Window Network Intrusion Detectors

ICPR '02 Proceedings of the 16 th International Conference on Pattern Recognition (ICPR'02) Volume 4 - Volume 4
Neural Networks for Novelty Detection in Airframe Strain Data

IJCNN '00 Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks (IJCNN'00)-Volume 6 - Volume 6
Novelty detection: a review—part 1: statistical approaches

Signal Processing
Novelty detection: a review—part 2: neural network based approaches

Signal Processing
Automatic word sense discrimination

Computational Linguistics - Special issue on word sense disambiguation
Combining Classifiers for word sense disambiguation

Natural Language Engineering
The Berkeley FrameNet Project

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Principle-based parsing without overgeneration

ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
Unsupervised methods for developing taxonomies by combining syntactic and statistical information

NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
Supersense tagging of unknown nouns using semantic similarity

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
A semantic approach to recognizing textual entailment

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing

Learning model order from labeled and unlabeled data for partially supervised classification, with application to word sense disambiguation

Computer Speech and Language
The framenet model and its applications†

Natural Language Engineering
Using language modeling to select useful annotation data

SRWS '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Student Research Workshop and Doctoral Consortium
A distributional similarity approach to the detection of semantic change in the Google Books Ngram corpus

GEMS '11 Proceedings of the GEMS 2011 Workshop on GEometrical Models of Natural Language Semantics

Quantified Score

Hi-index	0.00

Visualization

Abstract

We address the problem of unknown word sense detection: the identification of corpus occurrences that are not covered by a given sense inventory. We model this as an instance of outlier detection, using a simple nearest neighbor-based approach to measuring the resemblance of a new item to a training set. In combination with a method that alleviates data sparseness by sharing training data across lemmas, the approach achieves a precision of 0.77 and recall of 0.82.