Class-based probability estimation using a semantic hierarchy

Authors:
Stephen Clark;David Weir
Affiliations:
Division of Informatics, University of Edinburgh, 2 Buccleuch Place, Edinburgh, EH8 9LW, UK;School of Cognitive and Computing Sciences, University of Sussex, Brighton, BN1 9QH, UK
Venue:
Computational Linguistics
Year:
2002

Citing 16
Cited 33

Word association norms, mutual information, and lexicography

Computational Linguistics
Selection and information: a class-based approach to lexical relationships

Selection and information: a class-based approach to lexical relationships
Accurate methods for the statistics of surprise and coincidence

Computational Linguistics - Special issue on using large corpora: I
Generalizing case frames using a thesaurus and the MDL principle

Computational Linguistics
Using semantic preferences to identify verbal participation in role switching alternations

NAACL 2000 Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference
Automatic extraction of subcategorization from corpora

ANLC '97 Proceedings of the fifth conference on Applied natural language processing
On learning more appropriate Selectional Restrictions

EACL '95 Proceedings of the seventh conference on European chapter of the Association for Computational Linguistics
Distributional clustering of English words

ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
Explaining away ambiguity: learning verb selectional preference with Bayesian networks

COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 1
A class-based probabilistic approach to structural disambiguation

COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 1
Clustering words with the MDL principle

COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 1
Inducing a semantically annotated lexicon via EM-based clustering

ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
A decision tree of bigrams is an accurate predictor of word sense

NAACL '01 Proceedings of the second meeting of the North American Chapter of the Association for Computational Linguistics on Language technologies
Class-based probability estimation using a semantic hierarchy

NAACL '01 Proceedings of the second meeting of the North American Chapter of the Association for Computational Linguistics on Language technologies
A statistical model for parsing and word-sense disambiguation

EMNLP '00 Proceedings of the 2000 Joint SIGDAT conference on Empirical methods in natural language processing and very large corpora: held in conjunction with the 38th Annual Meeting of the Association for Computational Linguistics - Volume 13
Learning class-to-class selectional preferences

ConLL '01 Proceedings of the 2001 workshop on Computational Natural Language Learning - Volume 7

Using the web to obtain frequencies for unseen bigrams

Computational Linguistics - Special issue on web as corpus
Evaluating and combining approaches to selectional preference acquisition

EACL '03 Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - Volume 1
Detecting novel compounds: the role of distributional evidence

EACL '03 Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - Volume 1
Co-occurrence Retrieval: A Flexible Framework for Lexical Distributional Similarity

Computational Linguistics
Supersense tagging of unknown nouns using semantic similarity

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Strategies for lifelong knowledge extraction from the web

Proceedings of the 4th international conference on Knowledge capture
Building a dynamic lexicon from a digital library

Proceedings of the 8th ACM/IEEE-CS joint conference on Digital libraries
Word sense disambiguation: A survey

ACM Computing Surveys (CSUR)
Unsupervised Learning of Semantic Relations for Molecular Biology Ontologies

Proceedings of the 2008 conference on Ontology Learning and Population: Bridging the Gap between Text and Knowledge
Using selectional profile distance to detect verb alternations

CLS '04 Proceedings of the HLT-NAACL Workshop on Computational Lexical Semantics
Deriving generalized knowledge from corpora using WordNet abstraction

EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
Discriminative learning of selectional preference from unlabeled text

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Automatic fine-grained semantic classification for domain adaptation

STEP '08 Proceedings of the 2008 Conference on Semantics in Text Processing
A cognitive model for the representation and acquisition of verb selectional preferences

CACLA '07 Proceedings of the Workshop on Cognitive Aspects of Computational Language Acquisition
Hierarchical semantic classification: word sense disambiguation with world knowledge

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Unsupervised learning of semantic relations between concepts of a molecular biology ontology

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Context comparison as a minimum cost flow problem

TextGraphs-1 Proceedings of the First Workshop on Graph Based Methods for Natural Language Processing
Effective use of WordNet semantics via kernel-based learning

CONLL '05 Proceedings of the Ninth Conference on Computational Natural Language Learning
A graph-theoretic framework for semantic distance

Computational Linguistics
Cross-lingual induction of selectional preferences with bilingual vector spaces

HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
A latent dirichlet allocation method for selectional preferences

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Latent variable models of selectional preference

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
No sentence is too confusing to ignore

NLPLING '10 Proceedings of the 2010 Workshop on NLP and Linguistics: Finding the Common Ground
A mixture model with sharing for lexical semantics

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Combining contextual and structural information for supersense tagging of chinese unknown words

CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part I
Semantic relations in bilingual lexicons

ACM Transactions on Speech and Language Processing (TSLP)
A semantic kernel to exploit linguistic knowledge

AI*IA'05 Proceedings of the 9th conference on Advances in Artificial Intelligence
Distributional thesaurus versus wordnet: a comparison of backoff techniques for unsupervised PP attachment

CICLing'05 Proceedings of the 6th international conference on Computational Linguistics and Intelligent Text Processing
Measuring the impact of sense similarity on word sense induction

EMNLP '11 Proceedings of the First Workshop on Unsupervised Learning in NLP
Instance-driven attachment of semantic annotations over conceptual hierarchies

EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
Learning semantics and selectional preference of adjective-noun pairs

SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
Modelling selectional preferences in a lexical hierarchy

SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
A computational model of logical metonymy

ACM Transactions on Speech and Language Processing (TSLP) - Special issue on multiword expressions: From theory to practice and use, part 2

Quantified Score

Hi-index	0.00

Visualization

Abstract

This article concerns the estimation of a particular kind of probability, namely, the probability of a noun sense appearing as a particular argument of a predicate. In order to overcome the accompanying sparse-data problem, the proposal here is to define the probabilities in terms of senses from a semantic hierarchy and exploit the fact that the senses can be grouped into classes consisting of semantically similar senses. There is a particular focus on the problem of how to determine a suitable class for a given sense, or, alternatively, how to determine a suitable level of generalization in the hierarchy. A procedure is developed that uses a chi-square test to determine a suitable level of generalization. In order to test the performance of the estimation method, a pseudo-disambiguation task is used, together with two alternative estimation methods. Each method uses a different generalization procedure; the first alternative uses the minimum description length principle, and the second uses Resnik's measure of selectional preference. In addition, the performance of our method is investigated using both the standard Pearson chi-square statistic and the log-likelihood chi-square statistic.