Distributional clustering of words for text classification
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Explorations in Automatic Thesaurus Discovery
Explorations in Automatic Thesaurus Discovery
Class-based probability estimation using a semantic hierarchy
Computational Linguistics
Building a large annotated corpus of English: the penn treebank
Computational Linguistics - Special issue on using large corpora: II
Applied morphological processing of English
Natural Language Engineering
TnT: a statistical part-of-speech tagger
ANLC '00 Proceedings of the sixth conference on Applied natural language processing
A maximum entropy approach to identifying sentence boundaries
ANLC '97 Proceedings of the fifth conference on Applied natural language processing
Word-sense disambiguation using statistical models of Roget's categories trained on large corpora
COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 2
Investigating GIS and smoothing for maximum entropy taggers
EACL '03 Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - Volume 1
ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Unsupervised methods for developing taxonomies by combining syntactic and statistical information
NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
Chunking with maximum entropy models
ConLL '00 Proceedings of the 2nd workshop on Learning language in logic and the 4th conference on Computational natural language learning - Volume 7
Boosting automatic lexical acquisition with morphological information
ULA '02 Proceedings of the ACL-02 workshop on Unsupervised lexical acquisition - Volume 9
Improvements in automatic thesaurus extraction
ULA '02 Proceedings of the ACL-02 workshop on Unsupervised lexical acquisition - Volume 9
Supersense tagging of unknown nouns in WordNet
EMNLP '03 Proceedings of the 2003 conference on Empirical methods in natural language processing
Using information content to evaluate semantic similarity in a taxonomy
IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 1
Hierarchical semantic classification: word sense disambiguation with world knowledge
IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Unknown word sense detection as outlier detection
HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
Faceted search and retrieval based on semantically annotated product family ontology
Proceedings of the WSDM '09 Workshop on Exploiting Semantic Annotations in Information Retrieval
Methodological Review: Empirical distributional semantics: Methods and biomedical applications
Journal of Biomedical Informatics
An empirical study on class-based word sense disambiguation
EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
Broad-coverage sense disambiguation and information extraction with a supersense sequence tagger
EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Bootstrapping distributional feature vector quality
Computational Linguistics
GPLSI: word coarse-grained disambiguation aided by basic level concepts
SemEval '07 Proceedings of the 4th International Workshop on Semantic Evaluations
Information Processing and Management: an International Journal
GPLSI-IXA: Using semantic classes to acquire monosemous training examples from domain texts
SemEval '10 Proceedings of the 5th International Workshop on Semantic Evaluation
Semantic classification of automatically acquired nouns using lexico-syntactic clues
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Combining contextual and structural information for supersense tagging of chinese unknown words
CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part I
Induction of Semantic Classes Based on Coordinate Patterns
WI-IAT '11 Proceedings of the 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Volume 03
Learning semantics and selectional preference of adjective-noun pairs
SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
Regular polysemy: a distributional model
SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
Coarse lexical semantic annotation with supersenses: an Arabic case study
ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Short Papers - Volume 2
Predicting part-of-speech tags and morpho-syntactic relations using similarity-based technique
SLSP'13 Proceedings of the First international conference on Statistical Language and Speech Processing
Hi-index | 0.00 |
The limited coverage of lexical-semantic resources is a significant problem for NLP systems which can be alleviated by automatically classifying the unknown words. Supersense tagging assigns unknown nouns one of 26 broad semantic categories used by lexicographers to organise their manual insertion into WORDNET. Ciaramita and Johnson (2003) present a tagger which uses synonym set glosses as annotated training examples. We describe an unsupervised approach, based on vector-space similarity, which does not require annotated examples but significantly outperforms their tagger. We also demonstrate the use of an extremely large shallow-parsed corpus for calculating vector-space semantic similarity.